Skip to content

tuba89/ResearchForge-AI

Repository files navigation

ResearchForge AI βš›οΈ

System Infographics

ResearchForge AI Google Cloud Flask TailwindCSS License

Research Paper Download PDF

Multi-agent research collaboration platform built with Google Gemini & Agent Development Kit

| Features | Installation | Research Paper |

βš›οΈ Try Live Demo β€’ πŸ“– Read Blog β€’ πŸ’» Kaggle Notebook β€’ πŸ“„ Research Paper


🌟 Overview

ResearchForge AI is an intelligent multi-agent system that automates the entire research collaboration lifecycle. We transform a process that typically takes researchers months of manual effort into a seamless, minute-long conversation. By coordinating eight specialized AI agents, ResearchForge finds relevant papers, identifies ideal collaborators using ML-powered matching, and generates professional proposals and outreach emailsβ€”all through a natural language interface.

Built for the 5-Day AI Agents Intensive capstone project, it demonstrates advanced Agent-to-Agent (A2A) communication, real-world API integration, and production-ready deployment.

🎯 The Problem

Researchers waste countless hours on the logistics of collaboration instead of doing the research itself. Manually sifting through academic databases, assessing potential partners, and drafting proposals is slow, inefficient, and limits the scale and diversity of potential collaborations.

πŸ’‘ Our Solution

A sophisticated multi-agent system where each agent has a specialized role:

  • DataScout: Finds real papers via live arXiv API
  • ProfileBuilder: Constructs structured researcher profiles
  • MatchEngine: Uses FAISS + Sentence Transformers for ML-powered matching
  • Explainer: Provides clear reasoning behind recommendations
  • ProposalGenerator: Creates funding-ready research proposals
  • OutreachSpecialist: Drafts personalized collaboration emails
  • MemoryCurator: Learns user preferences over time
  • QualityEvaluator: Assesses match quality and system performance

An Orchestrator agent intelligently routes requests to the right specialist, enabling complex workflows like: "Search for papers on medical imaging, find me the top 3 collaborators, and draft a proposal."

🧠 How A2A Works (Simple Visual Trace)

Agents Diagram


πŸ†š ResearchForge AI vs ChatGPT

While ChatGPT is a powerful language model, ResearchForge AI extends its capabilities through real data pipelines, agent-to-agent communication, and ML-powered analysis.
The comparison below highlights the functional differences:

Capability ChatGPT Alone ResearchForge AI
Paper search image cannot access real APIs image live arXiv + researcher data
Verifiable citations image may generate fabricated references image real paper IDs + metadata
Persistent memory image limited across turns image session-level memory curator
Matching collaborators image no embeddings or FAISS image ML similarity + scoring
Agent collaboration image single-model reasoning only image eight autonomous agents using A2A
Proposal generation image generic text generation image proposals built from real retrieved sources
End-to-end workflow image requires manual steps image fully automated research pipeline

ResearchForge AI operates as a full platform, not just a model prompt.
ChatGPT serves as one component inside a broader, intelligent system.

✨ Features

πŸ” Smart Paper Discovery

  • Search millions of research papers from arXiv in real-time
  • Advanced filtering by category (AI, ML, CV, NLP, Robotics, etc.)
  • Instant results with complete metadata (authors, dates, abstracts)
  • Direct PDF downloads and arXiv page links

πŸ€– AI Research Assistant

  • Intelligent chat interface powered by Google Gemini
  • Multi-model fallback system for reliability (gemini-2.0-flash-exp, gemini-2.5-flash-lite, etc.)
  • Context-aware conversations with session management
  • Markdown-formatted responses with beautiful styling

πŸ“ Automated Content Generation

  • Generate comprehensive research proposals in seconds
  • Draft professional collaboration emails
  • Customizable templates with smart defaults
  • Professional formatting and structure

🎨 Modern User Experience

  • Beautiful gradient UI with glassmorphism effects
  • Responsive design (mobile, tablet, desktop)
  • Real-time typing indicators
  • Smooth animations and transitions
  • Dark mode code blocks with syntax highlighting

πŸ“Έ Screenshots

Hero Section

Hero Section

_Modern landing page with gradient design and intelligent agent orchestration_

AI Chat Interface

Chat Interface

_Real-time research assistant powered by 8 specialized AI agents_

Search & Export Features

Search and Export

_Advanced paper search with BibTeX export and automatic search history_

πŸ† Competition: Agents for Good

Track: Education & Research
Competition: 5-Day AI Agents Intensive Capstone
Course: Google Kaggle AI Agents Intensive

Why "Agents for Good"?

ResearchForge democratizes research collaboration:

image Helps researchers at underserved institutions access elite collaboration opportunities
image Reduces barriers to international collaboration
image Automates tedious logistics so researchers focus on discovery
image Accelerates scientific progress through better matching

Key Concepts Demonstrated

  • image Multi-Agent Systems: 8 specialized agents + orchestrator
  • image A2A Protocol: Agent-to-agent communication via Google ADK
  • image Real-World Integration: Live arXiv API, not synthetic data
  • image ML-Powered Tools: FAISS vector search + semantic embeddings
  • image Context Engineering: Proactive agent behaviors and memory
  • image Production Deployment: Google Cloud with 100+ concurrent users
  • image Observability: Comprehensive logging and performance metrics

πŸ““ Kaggle Notebook

View Complete Implementation on Kaggle β†’

The notebook includes:

  • image Full agent system code (V1 and V2 implementations)
  • image Interactive demos with real arXiv data
  • image Step-by-step explanations of each agent
  • image Architecture diagrams and visualizations
  • image Performance metrics and validation tests
  • image Context extraction and memory demonstrations

⭐ Please upvote if you find it helpful!


πŸ“„ Research Paper

Read our full research paper:

πŸ“– View Paper Online | πŸ“₯ Download PDF

Our paper details:

  • Multi-agent system architecture
  • Agent-to-Agent (A2A) communication protocol
  • ML-powered matching algorithms
  • Real-world deployment and performance metrics
  • Comparison with traditional approaches

πŸ—οΈ System Architecture

Architecture Diagram

Technology Stack

Layer Technologies
AI Models Google Gemini 2.0 Flash, Gemini 2.5 Flash Lite
Backend Flask, Google ADK, Python 3.12
Frontend HTML5, TailwindCSS, Vanilla JavaScript
ML/AI Tools FAISS, SentenceTransformers, arXiv API
Deployment Google Cloud App Engine (optional)
Observability Python logging, structured metrics

πŸ“¦ Installation

Prerequisites

  • Python 3.12+
  • Google API key for Gemini (Get one here)
  • (Optional) Google Cloud account for deployment

Quick Start

  1. Clone the repository
git clone https://github.com/yourusername/ResearchForge-AI.git
cd ResearchForge-AI
  1. Create virtual environment
python -m venv venv

# On macOS/Linux:
source venv/bin/activate

# On Windows:
venv\Scripts\activate
  1. Install dependencies
pip install -r requirements.txt
  1. Set up environment variables
# Copy template
cp .env.template .env

# Edit .env and add your credentials:
# GOOGLE_API_KEY=your_api_key_here
  1. Run the application
python app.py
  1. Open in browser
http://localhost:8080

πŸš€ Deployment

Option 1: Local/VPS Deployment

# Production mode
export FLASK_ENV=production
gunicorn -w 4 -b 0.0.0.0:8080 app:app

Option 2: Google Cloud App Engine

  1. Install Google Cloud SDK

    # macOS
    brew install google-cloud-sdk
    
    # Or download from: https://cloud.google.com/sdk/docs/install
  2. Initialize project

    gcloud init
    gcloud config set project YOUR_PROJECT_ID
  3. Update app.yaml

    runtime: python312
    env_variables:
      GOOGLE_API_KEY: "your-api-key"
  4. Deploy

    gcloud app deploy
  5. View your app

    gcloud app browse

For detailed deployment instructions, see DEPLOY.md


πŸ”§ Configuration

Environment Variables

Create a .env file in the project root:

# Required
GOOGLE_API_KEY=your_google_api_key_here

# Optional
GOOGLE_CLOUD_PROJECT=your_project_id
GOOGLE_CLOUD_LOCATION=us-central1
SECRET_KEY=your_secret_key_for_flask_sessions
PORT=8080

API Endpoints

Endpoint Method Description
/ GET Main application UI
/api/search POST Search arXiv papers
/api/chat POST Chat with AI agents
/api/export-bibtex POST Export papers to BibTeX
/api/search-history GET Get search history
/api/search-history POST Save search
/api/search-history DELETE Clear history
/api/health GET Health check
/api/agent-status GET Agent status

πŸ“š Usage Examples

Web Interface

  1. Search Papers: Enter keywords like "quantum computing" or "deep learning"
  2. Chat with AI: Ask questions like:
    • "Find papers about medical imaging AI"
    • "Generate a research proposal for climate change"
    • "Draft a collaboration email for my project"

API Usage

Search Papers:

curl -X POST http://localhost:8080/api/search \
  -H "Content-Type: application/json" \
  -d '{
    "query": "transformer models",
    "category": "cs.CL",
    "max_results": 5
  }'

Chat with Agent:

curl -X POST http://localhost:8080/api/chat \
  -H "Content-Type: application/json" \
  -d '{
    "message": "Find recent papers on reinforcement learning",
    "session_id": "user-123"
  }'

Example: Export to BibTeX

curl -X POST http://localhost:8080/api/export-bibtex \\
-H "Content-Type: application/json" \\
-d '{
"papers": [
{
"title": "Attention Is All You Need",
"authors": ["Vaswani et al."],
"arxiv_id": "1706.03762",
"published": "2017-06-12",
"abstract": "The dominant sequence transduction models..."
}
]
}'

🎯 Project Structure


ResearchForge-AI/
β”œβ”€β”€ main.py                               # Flask app with search history API
β”œβ”€β”€ agent.py                              # 8 agents + BibTeX export
β”œβ”€β”€ chat_interface.py                     # Session management
β”œβ”€β”€ requirements.txt                      # Python dependencies
β”œβ”€β”€ .env.template                         # Environment template
β”œβ”€β”€ .gitignore                            # Git ignore rules
β”‚
β”œβ”€β”€ templates/
β”‚ └── index.html                          # UI with export & history
β”‚
β”œβ”€β”€ static/
β”‚ β”œβ”€β”€ images/
β”‚ β”‚ β”œβ”€β”€ screenshots/                      #  App screenshots
β”‚ β”‚ β”‚ β”œβ”€β”€ hero_section.png
β”‚ β”‚ β”‚ β”œβ”€β”€ chat_interface.png
β”‚ β”‚ β”‚ └── search_export.png
β”‚ β”‚ β”œβ”€β”€ System_Infographics.png
β”‚ β”‚ └── ResearchForge_Diagram_dark.png
β”‚ β”œβ”€β”€ css/
β”‚ β”‚ └── agent-dashboard.css                # Gradient animations
β”‚ └── js/
β”‚ └── app.js                               # Export & history logic
β”‚
β”œβ”€β”€ docs/
β”‚ β”œβ”€β”€ DEPLOY.md
β”‚ └── CONTRIBUTING.md
β”‚
└── README.md


πŸ§ͺ Testing

Run Local Tests

# Test agent functionality
python test_agent.py

# Test API endpoints
python -m pytest tests/

Test Queries

Try these in the chat interface:

  1. "Find papers about transformer models in NLP"
  2. "Generate a proposal for AI in education"
  3. "Draft an email for my healthcare AI project"

🀝 Contributing

We welcome contributions! Here's how to get started:

  1. Fork the repository
  2. Create a feature branch (git checkout -b feature/AmazingFeature)
  3. Commit your changes (git commit -m 'Add AmazingFeature')
  4. Push to the branch (git push origin feature/AmazingFeature)
  5. Open a Pull Request

See CONTRIBUTING.md for detailed guidelines.


πŸ› Known Issues & Limitations

  • API Rate Limits: Free Gemini API has rate limits; multi-model fallback helps
  • arXiv Scope: Only searches arXiv (not all academic databases)
  • Session Persistence: In-memory sessions (lost on restart)

See Issues for planned improvements.


πŸ“Š Performance

  • Paper Search: < 2 seconds (arXiv API)
  • AI Response: 2-5 seconds (Gemini API)
  • Proposal Generation: < 3 seconds
  • Concurrent Users: 100+ (with gunicorn)

πŸ“„ License

This project is licensed under the MIT License - see the LICENSE file for details.


πŸ™ Acknowledgments

  • Google AI for the 5-Day Agents Intensive course
  • Google Gemini team for the incredible AI capabilities
  • arXiv.org for providing open access to research papers
  • Semantic Scholar for researcher data APIs

πŸ“§ Contact

Built by Assia, Ariamehr and Chukwuebuka for the Agents Intensive Capstone Project


πŸ‘₯ Team

Assia Benkedia -

Chukwuebuka Okeke -

Ariamehr Maleki -


⭐ If you find this project useful, please consider giving it a star!

Made with ❀️ and βš›οΈ using Google Gemini & Agent Development Kit

⬆ Back to Top

About

AI-powered research collaboration platform using Google Gemini. Find papers, generate proposals, and connect researchers in minutes.

Topics

Resources

License

Contributing

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors