Skip to content

Release the medical dataset generated by KISA #5

@IcecreamArtist

Description

@IcecreamArtist

Hello,

I'm very interested in your excellent work on MedResearcher-R1. Your paper describes a medical trajectory dataset generated through KISA from 30M+ PubMed abstracts, which appears to be a key contribution to achieving the impressive results on MedBrowseComp.

I noticed you've also open-sourced a high-quality QA dataset within your GitHub repository (specifically TrajectoryGenerationPipeline/qa_data/open_data.jsonl). However, upon examination, this dataset appears to focus on general-domain questions rather than medical-specific content, and may not be the actual dataset used for training MedResearcher-R1.

If you're interested, could you please release the dataset used in training MedResearcher-R1? Let me know if I have any misunderstanding.

Thank you very much!

Sincerely,
IcecreamArtist

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions