Paper-Agent: Intelligent Academic Survey Report Generation System

Languages: English · 简体中文

📖 Introduction

Paper-Agent is an automated survey report generation system for researchers, designed to address the pain points of "time-consuming and shallow analysis" in academic paper research. It is not a simple literature summarization tool, but an intelligent domain research assistant with full-process capabilities of "retrieval-reading-analysis-synthesis-report" that can generate in-depth and insightful domain survey reports.

📸 Project Preview

Click to enlarge screenshots

Screenshot 1	Screenshot 2	Screenshot 3

Screenshot 4	Screenshot 5	Screenshot 6

✨ Core Features

🤖 Multi-Agent Collaboration Architecture: Based on the AutoGen framework, adopting a multi-agent collaboration model covering retrieval, reading, analysis, writing, and other agents to automatically collaborate on complex tasks
📚 Intelligent Literature Retrieval: Converts natural language queries into precise search conditions with manual review support, retrieves relevant academic papers from arXiv
🔍 Structured Information Extraction: Intelligent reading extracts core problems, technical approaches, experimental results, datasets, limitations, and other key information from papers, outputting standardized JSON structures
🧠 In-Depth Domain Analysis: Through three-stage processes of cluster analysis, deep analysis, and global analysis, identifies research trends and emerging topics
✍️ Domain Survey Report Generation: Integrates analysis results into academically structured reports with clear logic, supporting Markdown format output
🔄 Real-time Streaming Output: Based on SSE (Server-Sent Events) technology, pushes task progress to the frontend in real-time
⚡ Parallel Processing Optimization: Supports parallel paper reading, parallel cluster analysis, and parallel chapter writing, significantly improving processing efficiency
🔧 Modular Design: Decoupled functional modules, built on LangGraph for workflow, easy to extend and maintain
💾 Vector Database Support: Uses ChromaDB to store extracted paper information, supporting retrieval-augmented writing
👥 User Interaction Review: Introduces manual review at key steps to ensure query conditions and generated content meet expectations

System Architecture

Here is a brief introduction. For more detailed information about system architecture, node implementation, and agent collaboration, please refer to the design.md document.

Paper-Agent adopts a modular design, builds a complete workflow based on LangGraph, and works through six core nodes:

Core Nodes

search_agent_node (Paper Search Node)
- Uses LLM to convert user natural language requirements into structured query conditions
- Performs manual review through user proxy (userProxyAgent)
- Calls PaperSearcher to retrieve relevant papers from arXiv
- Supports query conditions: querys, start_date, end_date
reading_agent_node (Paper Reading Node)
- Processes multiple papers in parallel, extracting core information from each paper
- Extracts according to predefined models: core problem, key methods, datasets, evaluation metrics, main results, limitations, contributions
- Stores extracted results in vector database for subsequent retrieval augmentation
analyse_agent_node (Paper Analysis Node)
- PaperClusterAgent: Uses embedding vectors and KMeans algorithm for paper clustering, automatically determining cluster count
- DeepAnalyseAgent: Performs in-depth analysis on each cluster, including technical approaches, method comparisons, application domains, etc.
- GlobalanalyseAgent: Summarizes all cluster analysis results to generate a global analysis report containing six major modules
writing_agent_node (Writing Node)
- writing_director_node: Generates report outline and splits it into writing sub-tasks based on user requirements and global analysis
- parallel_writing_node: Executes all writing sub-tasks in parallel, using multi-agent collaboration to complete chapter writing
- Supports retrieval-augmented writing and quality review
report_agent_node (Report Generation Node)
- Summarizes all written chapters to generate a complete survey report
- Outputs in Markdown format, automatically adding transitional sentences
- Streaming output, pushing generation progress in real-time

Sub-Agent Architecture

Writing Module Sub-Agents

writing_agent: Responsible for writing chapter content based on sub-tasks
retrieval_agent: Retrieves relevant content from vector database to supplement writing materials
review_agent: Reviews writing content quality, outputs "APPROVE" to terminate sub-task upon approval

Analysis Module Sub-Agents

PaperClusterAgent: Paper cluster analysis, generates topic
DeepAnalyseAgent: In-depth analysis of single descriptions and keywords clusters
GlobalanalyseAgent: Global analysis, generates six-module report

Workflow Architecture

Orchestrator Module
- Builds complete workflow based on LangGraph
- Coordinates orderly execution of nodes
- Manages global state and error handling
- Pushes task progress to frontend in real-time via SSE
State Management
- Uses State to manage global state
- Implements frontend-backend communication through queues
- Supports real-time state push

Workflow

The system builds a complete workflow based on LangGraph and completes survey report generation through six core nodes:

Complete Process

Input Query: User provides research topic or question
Paper Retrieval: System automatically generates query conditions with manual review support, retrieves relevant papers from arXiv
Paper Reading: Processes multiple papers in parallel, extracts and structures core information
In-Depth Analysis:
- Cluster Analysis: Groups papers by topic
- Deep Analysis: Performs in-depth analysis on each cluster including technical approaches, method comparisons, etc.
- Global Analysis: Summarizes all cluster results to generate six-module report
Content Generation:
- Generate Outline: Generates report outline based on user requirements and global analysis
- Task Splitting: Parses outline into parallel executable writing sub-tasks
- Parallel Writing: Uses multi-agent collaboration to complete chapter writing
Report Integration: Summarizes all chapters to generate complete Markdown format survey report

Key Features

Real-time Streaming Output: Based on SSE technology, pushes task progress to frontend in real-time
Parallel Processing Optimization: Parallel paper reading, parallel cluster analysis, parallel chapter writing
User Interaction Review: Introduces manual review at key steps
Retrieval-Augmented Writing: Retrieves relevant content from vector database to supplement writing materials
Quality Review Mechanism: review_agent reviews writing content quality

📂 Directory Structure

Paper-Agents/
├── main.py                 # Application main entry, FastAPI application initialization
├── pyproject.toml          # Python project configuration and dependency declaration
├── LICENSE                 # MIT license file
├── README.md               # English documentation
├── .gitignore              # Git ignore file
│
├── docs/                   # Documentation directory
│   ├── README_cn.md        # Chinese documentation
│   └── design.md           # System design document
│
├── src/                    # Source code directory
│   ├── agents/             # Agent module
│   │   ├── orchestrator.py         # Workflow orchestrator
│   │   ├── search_agent.py         # Paper search agent
│   │   ├── userproxy_agent.py      # User review agent
│   │   ├── reading_agent.py        # Paper reading agent
│   │   ├── analyse_agent.py        # Paper analysis agent
│   │   ├── writing_agent.py        # Content writing agent
│   │   ├── report_agent.py         # Report generation agent
│   │   ├── sub_analyse_agent/      # Sub-analysis agent directory
│   │   │   ├── cluster_agent.py           # Paper cluster agent
│   │   │   ├── deep_analyse_agent.py      # Paper deep analysis agent
│   │   │   └── global_analyse_agent.py    # Global analysis agent
│   │   └── sub_writing_agent/      # Sub-writing agent directory
│   │       ├── writing_director_agent.py    # Writing director agent
│   │       ├── parallel_writing_node.py     # Parallel writing node
│   │       ├── writing_agent.py             # Chapter writing agent
│   │       ├── retrieval_agent.py           # Retrieval augmentation agent
│   │       ├── review_agent.py              # Quality review agent
│   │       ├── writing_chatGroup.py         # Writing collaboration group
│   │       └── writing_state_models.py      # Writing state models
│   │
│   ├── core/               # Core module
│   │   ├── config.py        # Configuration management
│   │   ├── model_client.py  # Model client
│   │   ├── models.yaml      # Model configuration
│   │   ├── prompts.py       # Prompt templates
│   │   └── state_models.py  # State model definitions
│   │
│   ├── services/           # Service layer
│   │   ├── arxiv_client.py           # arXiv API client
│   │   ├── arxiv_fetcher.py          # arXiv paper fetcher
│   │   ├── chroma_client.py          # Chroma vector database client
│   │   └── retrieval_tool.py         # Retrieval tool
│   │
│   ├── tasks/              # Task module
│   │   ├── deduplicator.py      # Paper deduplication (planned)
│   │   ├── paper_downloader.py  # Paper download (planned)
│   │   ├── paper_filter.py      # Paper filtering (planned)
│   │   ├── paper_search.py      # Paper search
│   │   └── papers/              # Paper storage directory (planned)
│   │
│   └── utils/              # Utility functions
│       └── log_utils.py    # Logging utilities
│
├── test/                   # Test directory
│   ├── test_analyseAgent.py    # Analysis agent test
│   ├── test_readingAgent.py    # Reading agent test
│   ├── test_searchAgent.py     # Search agent test
│   ├── test_writingAgent.py    # Writing agent test
│   └── test_workflow.py        # Workflow test
│
├── web/                    # Frontend directory
│   ├── index.html          # Frontend entry page
│   ├── package.json        # Frontend dependency configuration
│   ├── src/                # Frontend source code
│   └── vite.config.js      # Vite configuration
│
├── data/                   # Data storage directory
└── output/                 # Output directory
    └── log/                # Log output directory


## 🚀 Quick Start

1. **Environment Preparation**
   - Python 3.12+
   - The project uses poetry to manage virtual environments
   - Install dependencies: `poetry install`

2. **Configure Environment**
   - Copy `.env.example` to `.env` and fill in your API key
   - Modify parameters in `models.yaml`

3. **Run System**
   ```bash
   poetry run python main.py

Web Interface
```
cd web && npm install && npm run dev
```
- Access http://localhost:5173 to use the web interface

Configuration Guide

Environment Variable Configuration

Set API keys and related configurations in the .env file:

# Model provider API key
OPENAI_API_KEY=your_openai_api_key
# Or other provider's API key

Model Configuration

System configuration file is located in models.yaml. Adjust the following parameters as needed:

Optional Model Providers

OpenAI
Other compatible LLM providers

Default model and embedding model configuration used by the project

Default LLM model
Default embedding model
Model parameters (temperature, max_tokens, etc.)

Model and embedding model configuration specifically used by project modules (optional)

search_agent: Search-specific model
reading_agent: Reading-specific model
analyse_agent: Analysis-specific model
writing_agent: Writing-specific model
report_agent: Report generation-specific model
Embedding model configuration for each module

API keys and base URLs for each model provider

API key configuration
Base URL configuration
Other connection parameters

Configuration Example

# models.yaml example
default:
  model-provider: "openai"
  model: "gpt-4"
  embedding-model: "text-embedding-3-large"
  embedding-dimension: 1024

modules:
  search_agent:
    model-provider: "openai"
    model: "gpt-3.5-turbo"
  reading_agent:
    model-provider: "openai"
    model: "gpt-4"
  analyse_agent:
    model-provider: "openai"
    model: "gpt-4"
  writing_agent:
    model-provider: "openai"
    model: "gpt-4"
  report_agent:
    model-provider: "openai"
    model: "gpt-4"

openai:
  api-key: ${OPENAI_API_KEY}
  base-url: https://api.openai.com/v1

Tech Stack

Backend

Programming Language: Python 3.12+
Agent Framework:
- AutoGen: Multi-agent collaboration framework
- LangGraph: Workflow orchestration framework
Web Framework: FastAPI, Uvicorn
Real-time Communication: SSE (Server-Sent Events)
Vector Database: ChromaDB
Data Processing: pyyaml, python-dotenv, tenacity
Machine Learning:
- scikit-learn: KMeans clustering, elbow method
- numpy: Vector computation
Paper Retrieval: arXiv API
Network Requests: requests, aiohttp
Package Management: Poetry
Logging System: Python standard library logging module (custom configuration)

Frontend

Framework: Vue.js 3.4+
Build Tool: Vite 5.0+
Development Tool: @vitejs/plugin-vue

Contributing Guide

We welcome contributions in various forms, including but not limited to:

Submit issues to report bugs or suggest new features
Submit pull requests to improve code
Improve documentation

Please read CONTRIBUTING.md for more details.

License

This project uses MIT license. See LICENSE file for details.

Contact

If you have any questions or suggestions, please provide feedback through:

GitHub Issues: Please submit Issues in the project repository, this is the most recommended way to report issues
Project Homepage: https://github.com/Tswoen/paper-agent

⭐ If this project is helpful to you, please give us a star to show your support!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Paper-Agent: Intelligent Academic Survey Report Generation System

📖 Introduction

📸 Project Preview

✨ Core Features

System Architecture

Core Nodes

Sub-Agent Architecture

Workflow Architecture

Workflow

Complete Process

Key Features

📂 Directory Structure

Configuration Guide

Environment Variable Configuration

Model Configuration

Configuration Example

Tech Stack

Backend

Frontend

Contributing Guide

License

Contact

Star History

About

Uh oh!

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 89 Commits
docs		docs
src		src
test		test
web		web
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
design.md		design.md
example.env		example.env
main.py		main.py
pyproject.toml		pyproject.toml

Folders and files

Latest commit

History

Repository files navigation

Paper-Agent: Intelligent Academic Survey Report Generation System

📖 Introduction

📸 Project Preview

✨ Core Features

System Architecture

Core Nodes

Sub-Agent Architecture

Workflow Architecture

Workflow

Complete Process

Key Features

📂 Directory Structure

Configuration Guide

Environment Variable Configuration

Model Configuration

Configuration Example

Tech Stack

Backend

Frontend

Contributing Guide

License

Contact

Star History

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages