publications
publications by categories in reversed chronological order. generated by jekyll-scholar.
2025
- Magma: A Foundation Model for Multimodal AI Agents2025
- QuantRad: Advancing Quantitative Reliability in Radiology Report Generation with Cascaded DecodersarXiv preprint arXiv:x, 2025
- LLaVA-Rad MIMIC-CXR AnnotationsarXiv preprint arXiv:x, 2025
- Universal Abstraction: Harnessing Frontier Models to Structure Real-World Data at ScalearXiv preprint arXiv:2502.00943, 2025
2024
- Universalner: Targeted distillation from large language models for open named entity recognitionIn ICLR, 2024
- Biomedjourney: Counterfactual biomedical image generation by instruction-learning from multimodal patient journeysarXiv preprint arXiv:2310.10765, 2024
- Foundation Models for Biomedical Image Segmentation: A SurveyarXiv preprint arXiv:2401.07654, 2024
- Training small multimodal models to bridge biomedical competency gap: A case study in radiology imagingCoRR, 2024
- BiomedParse: a biomedical foundation model for image parsing of everything everywhere all at oncearXiv preprint arXiv:2405.12971, 2024
- A whole-slide foundation model for digital pathology from real-world dataNature, 2024
- Medimageinsight: An open-source embedding model for general domain medical imagingarXiv preprint arXiv:2410.06542, 2024
- A foundation model for joint segmentation, detection and recognition of biomedical objects across nine modalitiesNature methods, 2024
2023
- Fine-tuning large neural language models for biomedical natural language processingCell Press Patterns, 2023
- Toward structuring real-world data: Deep learning for extracting oncology information from clinical text with patient-level supervisionPatterns, 2023
- Distilling large language models for biomedical knowledge extraction: A case study on adverse drug eventsarXiv preprint arXiv:2307.06439, 2023
- Interactive Span Recommendation for Biomedical TextIn ACL - Proceedings of the 5th Clinical Natural Language Processing Workshop, 2023
- Toward structuring real-world data: Deep learning for extracting oncology information from clinical text with patient-level supervisionCell Press Patterns, 2023
- Scaling clinical trial matching using large language models: a case study in oncologyIn Machine Learning for Healthcare Conference, 2023
- Kola: Carefully benchmarking world knowledge of large language modelsarXiv preprint arXiv:2306.09296, 2023
2022
- Domain-specific language model pretraining for biomedical natural language processingACM, 2022
2021
- Domain-specific language model pretraining for biomedical natural language processingACM Transactions on Computing for Healthcare (HEALTH), 2021
2020
- Extracting medications and associated adverse drug events using a natural language processing system combining knowledge base and deep learningJournal of the American Medical Informatics Association, 2020
- Clinical concept normalization with a hybrid natural language processing system combining multilevel matching and machine learning rankingJournal of the American Medical Informatics Association, 2020
- Domain-Specific Language Model Pretraining for Biomedical Natural Language Processing 2020arXiv preprint arXiv:2007.15779, 2020
- Domain-specific language model pretraining for biomedical natural language processing. arXivpreprint, 2020
2019
- Clinical trial cohort selection based on multi-level rule-based natural language processing systemJournal of the American Medical Informatics Association, 2019
2017
- Symptom severity classification with gradient tree boostingJournal of biomedical informatics, 2017