publications
publications in reversed chronological order. generated by jekyll-scholar.
2025
- Genome-wide association study of prostate-specific antigen levels in 392,522 men identifies new loci and improves prediction across ancestry groupsNature genetics, 2025
- Enabling end-to-end secure federated learning in biomedical research on heterogeneous computing environments with APPFLxComputational and Structural Biotechnology Journal, 2025
- Generation of synthetic whole-slide image tiles of tumours from RNA-sequencing data via cascaded diffusion modelsNature Biomedical Engineering, 2025
- Advances in appfl: A comprehensive and extensible federated learning frameworkIn 2025 IEEE 25th International Symposium on Cluster, Cloud and Internet Computing (CCGrid), 2025
- scPrediXcan integrates advances in deep learning and single-cell data into a powerful cell-type–specific transcriptome-wide association study frameworkbioRxiv, 2025
- Evaluating Multi-Ancestry Genome-Wide Association Methods: Statistical Power, Population Structure, and Practical ImplicationsmedRxiv, 2025
- Pathology Image Compression with Pre-trained AutoencodersIn International Conference on Medical Image Computing and Computer-Assisted Intervention, 2025
- Genome-Wide Assessment of Pleiotropy Across> 1000 Traits from Global BiobanksmedRxiv, 2025
- Evaluating Vision and Pathology Foundation Models for Computational Pathology: A Comprehensive Benchmark StudymedRxiv, 2025
- Benchmarking accelerated next-generation sequencing analysis pipelinesBioinformatics Advances, 2025
- scPrediXcan integrates deep learning methods and single-cell data into a cell-type-specific transcriptome-wide association study frameworkCell Genomics, 2025
- Genome-wide association study of long COVIDNature Genetics, 2025
- AIDRIN 2.0: A Framework to Assess Data Readiness for AIarXiv preprint arXiv:2505.18213, 2025
- FedCostAware: Enabling Cost-Aware Federated Learning on the CloudarXiv preprint arXiv:2505.21727, 2025
- CADRE: Customizable Assurance of Data Readiness in Privacy-Preserving Federated LearningarXiv preprint arXiv:2505.23849, 2025
- PixCell: A generative foundation model for digital histopathology imagesarXiv preprint arXiv:2506.05127, 2025
- Enhancing Clinical Models with Pseudo Data for De-identificationarXiv preprint arXiv:2506.12674, 2025
- CDG-MAE: Learning Correspondences from Diffusion Generated ViewsarXiv preprint arXiv:2506.18164, 2025
- Evaluating Vision and Pathology Foundation Models for Computational Pathology: A Comprehensive Benchmark StudyResearch Square, 2025
- Closing the loop: Teaching single-cell foundation models to learn from perturbationsbioRxiv, 2025
- Increasing Value in the Veterans Affairs Healthcare System (VA) with Precision Health: A Continuing Landmark Collaboration with the Department of EnergymedRxiv, 2025
- Leveraging Open-Source Large Language Models to Identify Undiagnosed Patients with Rare Genetic AortopathiesmedRxiv, 2025
- Cost-Aware Federated Learning on the CloudIn 2025 IEEE International Conference on eScience (eScience), 2025
- Experiences Building Enterprise-Level Privacy-Preserving Federated Learning to Power AI for SciencearXiv preprint arXiv:2511.08998, 2025
- AIDRIN: A Comprehensive Toolset for Automating Data Preparation for AI2025
2024
- Diversity and scale: Genetic architecture of 2068 traits in the VA Million Veteran ProgramScience, 2024
- Adaption and national validation of a tool for predicting mortality from other causes among men with nonmetastatic prostate cancerEuropean Urology Oncology, 2024
- Secure Federated Learning Across Heterogeneous Cloud and High-Performance Computing Resources: A Case Study on Federated Fine-Tuning of LLaMA 2Computing in Science & Engineering, 2024
- MaPPeRTrac: A Massively Parallel, Portable, and Reproducible Tractography PipelineNeuroinformatics, 2024
- Accelerating Genome-and Phenome-Wide Association Studies using GPUs–A case study using data from the Million Veteran ProgrambioRxiv, 2024
- Transcriptome-wide association analysis identifies candidate susceptibility genes for prostate-specific antigen levels in men without prostate cancerHuman Genetics and Genomics Advances, 2024
- Ai data readiness inspector (aidrin) for quantitative assessment of data readiness for aiIn Proceedings of the 36th International Conference on Scientific and Statistical Database Management, 2024
- Quantized multi-task learning for context-specific representations of gene network dynamicsbioRxiv, 2024
- Advances in privacy preserving federated learning to realize a truly learning healthcare systemIn 2024 IEEE 6th International Conference on Trust, Privacy and Security in Intelligent Systems, and Applications (TPS-ISA), 2024
- 54. PREDICTION OF ANTIPSYCHOTIC-INDUCED METABOLIC ADVERSE EFFECTS USING MULTIMODAL ARTIFICIAL INTELLIGENCEEuropean neuropsychopharmacology, 2024
- A landmark federal interagency collaboration to promote data science in health care: Million Veteran Program-Computational Health Analytics for Medical Precision to Improve Outcomes NowJAMIA open, 2024
- Crowd-sourced machine learning prediction of long COVID using data from the National COVID Cohort CollaborativeEBioMedicine, 2024
- Federated Learning, Curriculum Learning, and Unbalanced Data in COVID-19 ClassificationIn AAPM 66th Annual Meeting & Exhibition, 2024
- Gen-sis: Generative self-augmentation improves self-supervised learningarXiv preprint arXiv:2412.01672, 2024
- Privacy-preserving federated learning for science: Challenges and research directionsIn 2024 IEEE International Conference on Big Data (BigData), 2024
- scPrediXcan: A Method for Transcriptome-Wide Association Studies at Cell-type Level Using Deep LearningIn GENETIC EPIDEMIOLOGY, 2024
- Rapid Long-Range Linkage Disequilibrium Calculations at Biobank Scale using GPU AccelerationIn GENETIC EPIDEMIOLOGY, 2024
- Monocyte-endothelial interactions as a targetable node in clonal hematopoiesis-mediated cardiovascular diseasemedRxiv, 2024
2023
- Identification of novel, replicable genetic risk loci for suicidal thoughts and behaviors among US military veteransJAMA psychiatry, 2023
- Genetics of varicose veins reveals polygenic architecture and genetic overlap with arterial and venous diseaseNature Cardiovascular Research, 2023
- MIDRC CRP10 AI interface—an integrated tool for exploring, testing and visualization of AI modelsPhysics in Medicine & Biology, 2023
- Evidence of novel susceptibility variants for prostate cancer and a multiancestry polygenic risk score associated with aggressive disease in men of African ancestryEuropean urology, 2023
- Genome-wide association study identifies four pan-ancestry loci for suicidal ideation in the Million Veteran ProgramPLoS genetics, 2023
- User experience evaluation for MIDRC AI interfaceIn Medical Imaging 2023: Imaging Informatics for Healthcare, Research, and Applications, 2023
- Cardiovascular disease risk assessment using traditional risk factors and polygenic risk scores in the million veteran programJAMA cardiology, 2023
- Evaluating approaches for constructing polygenic risk scores for prostate cancer in men of African and European ancestryThe American Journal of Human Genetics, 2023
- Phecode-enabled GWASs identify known and novel loci for eye traits and disorders (EYEWas)Investigative Ophthalmology & Visual Science, 2023
- RNA-to-image multi-cancer synthesis using cascaded diffusion modelsbioRxiv, 2023
- FAIR for AI: An interdisciplinary and international community building perspectiveScientific data, 2023
- Appflx: Providing privacy-preserving cross-silo federated learning as a serviceIn 2023 IEEE 19th international conference on e-science (e-science), 2023
- FedCompass: efficient cross-silo federated learning on heterogeneous client devices using a computing power aware schedulerarXiv preprint arXiv:2309.14675, 2023
- GWAS meta-analysis of suicide attempt: identification of 12 genome-wide significant loci and implication of genetic risks for specific health factorsAmerican journal of psychiatry, 2023
- Characterizing prostate cancer risk through multi-ancestry genome-wide discovery of 187 novel risk variantsNature genetics, 2023
- Correction: Reproducible big data science: A case study in continuous FAIRnessPlos one, 2023
- Reproducible big data science: A case study in continuous FAIRness (vol 14, e0213013, 2019)PLOS ONE, 2023
- GWAS Meta-Analysis of Suicide Attempt2023
2022
- A Phenome-Wide Association Study of genes associated with COVID-19 severity reveals shared genetics with complex diseases in the Million Veteran ProgramPLoS genetics, 2022
- Dissecting the shared genetic architecture of suicide attempt, psychiatric disorders, and known risk factorsBiological psychiatry, 2022
- Abstract PO-163: Genome-wide polygenic risk score of prostate cancer in African and European ancestry menCancer Epidemiology, Biomarkers & Prevention, 2022
- Polygenic predisposition to venous thromboembolism is associated with increased COVID-19 positive testing ratesmedRxiv, 2022
- APPFL: open-source software framework for privacy-preserving federated learningIn 2022 IEEE international parallel and distributed processing symposium workshops (IPDPSW), 2022
- Comparison of machine learning and deep learning for view identification from cardiac magnetic resonance imagesClinical imaging, 2022
- Multi-ancestry Genome-wide Association Study of Varicose Veins Reveals Polygenic Architecture, Genetic Overlap with Arterial and Venous Disease, and Novel Therapeutic OpportunitiesmedRxiv, 2022
- A genome-wide association study of suicide attempts in the million veterans program identifies evidence of pan-ancestry and ancestry-specific risk lociMolecular psychiatry, 2022
- Validation of a multi-ancestry polygenic risk score and age-specific risks of prostate cancer: A meta-analysis within diverse populationsElife, 2022
- The case for optimized edge-centric tractography at scaleFrontiers in Neuroinformatics, 2022
- Association of kidney comorbidities and acute kidney failure with unfavorable outcomes after COVID-19 in individuals with the sickle cell traitJAMA internal medicine, 2022
- A MUC5B gene polymorphism, rs35705950-T, confers protective effects against COVID-19 hospitalization but not severe disease or mortalityAmerican journal of respiratory and critical care medicine, 2022
- Evidence of Novel Susceptibility Variants for Prostate Cancer and a Polygenic Risk Score that Improves Prediction of Aggressive Disease for Men of African AncestryIn GENETIC EPIDEMIOLOGY, 2022
- MACE prediction using high-dimensional machine learning and mechanistic interpretation: A longitudinal cohort study in US veteransMedRxiv, 2022
- HIGH-PERFORMANCE COMPUTING MEETS HIGH-PERFORMANCE MEDICINEIn PACIFIC SYMPOSIUM ON BIOCOMPUTING 2023: Kohala Coast, Hawaii, USA, 3–7 January 2023, 2022
- APPFL: OPEN-SOURCE FRAMEWORK FOR PRIVACY-PRESERVING FEDERATED LEARNING2022
- Kidney failure with unfavorable outcomes after COVID-19 in individuals with the sickle cell trait.2022
- Genome-wide association study meta-analysis of suicide attempt identifies twelve genome-wide significant loci and implicates genetic risks for specific health factorsMedRxiv, 2022
2021
- Toward Optimized and Predictive Connectomics at ScalebioRxiv, 2021
- Abstract LB011: Meta-analysis in more than 80,000 men of African ancestry identified nine novel variants associated with prostate cancerCancer Research, 2021
- A MUC5B gene polymorphism, rs35705950-T, confers protective effects in COVID-19 infectionmedRxiv, 2021
- Transancestry Genome-wide Association Study of Varicose Veins in More Than 1 Million Individuals Reveals Circulating Effectors of Venous DiseaseJVS-Vascular Science, 2021
- Genome-wide Polygenic Risk Score of Prostate Cancer in African and European Ancestry MenIn GENETIC EPIDEMIOLOGY, 2021
- MaPPeRTrac2021
- Trans-ancestry Genome-wide Association Study Of Varicose Veins In> 1 Million Individuals Reveals Circulating Effectors Of Venous DiseaseArteriosclerosis, Thrombosis, and Vascular Biology, 2021
- A Phenome-Wide Association Study of Genes Associated with COVID-19 Severity Reveals Shared Genetics with Rheumatic Conditions: Abstract Number: 1938Arthritis & Rheumatology, 2021
2020
- Atlas of transcription factor binding sites from ENCODE DNase hypersensitivity data across 27 tissue typesCell reports, 2020
- Using the face-it portal and workflow engine for operational food quality prediction and assessment: An application to mussel farms monitoring in the bay of napoli, italyFuture Generation Computer Systems, 2020
- Prevalence of inherited mutations in breast cancer predisposition genes among women in Uganda and CameroonCancer Epidemiology, Biomarkers & Prevention, 2020
- Mappertrac: A massively parallel, portable, and reproducible tractography pipelinebioRxiv, 2020
2019
- Reproducible big data science: A case study in continuous FAIRnessPloS one, 2019
- Scientific Workflow Use Cases, version 1.1Scientific Workflow Use Cases, v1. 1, 2019
- Genetic deletion of Sphk2 confers protection against Pseudomonas aeruginosa mediated differential expression of genes related to virulent infection and inflammation in mouse lungBMC genomics, 2019
- Big Data Bags: A Scalable Packaging Format for Science.In RO, 2019
2018
- BDQC: a general-purpose analytics tool for domain-blind validation of Big DatabioRxiv, 2018
- Expression profiling of genes regulated by Sphingosine kinase 2 in a murine model of Pseudomonas aeruginosa mediated acute lung inflammationThe FASEB Journal, 2018
- O3-03-01: MECHANISTIC AND DIRECTIONAL TRANSCRIPTIONAL REGULATORY NETWORKS IN ALZHEIMER’S DISEASEAlzheimer’s & Dementia, 2018
2017
- A novel MERTK mutation causing retinitis pigmentosaGraefe’s Archive for Clinical and Experimental Ophthalmology, 2017
- Developing a framework for digital objects in the Big Data to Knowledge (BD2K) commons: Report from the Commons Framework Pilots workshopJournal of biomedical informatics, 2017
- Globus: A case study in software as a service for scientistsIn Proceedings of the 8th Workshop on Scientific Cloud Computing, 2017
- Cost-aware cloud profiling, prediction, and provisioning as a serviceIEEE Cloud Computing, 2017
2016
- Models and simulations as a service: exploring the use of galaxy for delivering computational modelsBiophysical journal, 2016
- LINE1 insertions as a genomic risk factor for schizophrenia: Preliminary evidence from an affected familyAmerican Journal of Medical Genetics Part B: Neuropsychiatric Genetics, 2016
- Cover Image, Volume 171B, Number 4, June 2016American Journal of Medical Genetics Part B: Neuropsychiatric Genetics, 2016
- An automated tool profiling service for the cloudIn 2016 16th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGrid), 2016
- Predictive big data analytics: a study of Parkinson’s disease using large, complex, heterogeneous, incongruent, multi-source and incomplete observationsPloS one, 2016
- Science as a Service [Guest editors’ introduction]Computing in Science & Engineering, 2016
- I’ll take that to go: Big data bags and minimal identifiers for exchange of large, complex datasetsIn Big Data (Big Data), 2016 IEEE International Conference on, 2016
- Applications of the FACE-IT portal and workflow engine for operational food quality prediction and assessment: Mussel farm monitoring in the Bay of Napoli, ItalyIn CEUR Workshop Proceedings, 2016
2015
- Consensus Genotyper for Exome Sequencing (CGES): improving the quality of exome variant genotypesBioinformatics, 2015
- A case study for cloud based high throughput analysis of NGS data using the globus genomics systemComputational and structural biotechnology journal, 2015
- FACE-IT: A science gateway for food security researchConcurrency and Computation: Practice and Experience, 2015
- The Globus Galaxies platform: delivering science gateways as a serviceConcurrency and Computation: Practice and Experience, 2015
- PDACS: a portal for data analysis services for cosmological simulationsComputing in Science & Engineering, 2015
- Big biomedical data as the key resource for discovery scienceJournal of the American Medical Informatics Association, 2015
- Scalable pCT image reconstruction delivered as a cloud serviceIEEE Transactions on Cloud Computing, 2015
- Cost-aware elastic cloud provisioning for scientific workloadsIn 2015 IEEE 8th International Conference on Cloud Computing, 2015
- Cost-aware cloud provisioningIn 2015 IEEE 11th International Conference on e-Science, 2015
2014
- Experiences building Globus Genomics: a next-generation sequencing analysis service using Galaxy, Globus, and Amazon Web ServicesConcurrency and Computation: Practice and Experience, 2014
- Cloud-based bioinformatics workflow platform for large-scale next-generation sequencing analysesJournal of biomedical informatics, 2014
- A cloud-based image analysis gateway for traumatic brain injury researchIn 2014 9th Gateway Computing Environments Workshop, 2014
- PDACS-A Portal for Data Analysis Services for Cosmological SimulationsIn 2014 9th Gateway Computing Environments Workshop, 2014
- ÔØ Å ÒÙ\times Ö ÔØ2014
2013
- Science as a service: how on-demand computing can accelerate discoveryIn Proceedings of the 4th ACM workshop on Scientific cloud computing, 2013
- Experiences in building a next-generation sequencing analysis service using galaxy, globus online and Amazon web serviceIn Proceedings of the Conference on Extreme Science and Engineering Discovery Environment: Gateway to Discovery, 2013
- Enabling multi-task computation on galaxy-based gateways using swiftIn 2013 IEEE International Conference on Cluster Computing (CLUSTER), 2013
- Distributed tools deployment and management for multiple galaxy instances in globus genomicsIn Proceedings of the 8th Workshop on Workflows in Support of Large-Scale Science, 2013
- Extending the Galaxy portal with parallel and distributed execution capabilityIn Proceedings of the Fourth International Workshop on Data Intensive Computing in the Clouds 2013, 2013
2012
- Deploying bioinformatics workflows on clouds with galaxy and globus provisionIn 2012 SC Companion: High Performance Computing, Networking Storage and Analysis, 2012
- Utilisation of a thoracic oncology database to capture radiological and pathological images for evaluation of response to chemotherapy in patients with malignant pleural mesotheliomaBMJ open, 2012
2011
- Enabling collaborative research using the biomedical informatics research network (BIRN)Journal of the American Medical Informatics Association, 2011
- Servicemap: Providing map and gps assistance to service composition in bioinformaticsIn 2011 IEEE international conference on services computing, 2011
- Toward semantics empowered biomedical web servicesIn 2011 IEEE International Conference on Web Services, 2011
- Recommend-as-you-go: A novel approach supporting services-oriented scientific workflow reuseIn 2011 IEEE international conference on services computing, 2011
- Improving the efficiency of subset queries on raster imagesIn Proceedings of the ACM SIGSPATIAL Second International Workshop on High Performance and Distributed Geographic Information Systems, 2011
- The cardiovascular research grid (cvrg) projectProceedings of the AMIA Summit on Translational Bioinformatics, 2011
- Providing map and GPS assistance to service composition in bioinformaticsIn 2011 IEEE International Conference on Services Computing, 2011
2010
- CaGrid Workflow Toolkit: A taverna based workflow tool for cancer gridBMC bioinformatics, 2010
- A comparison of using Taverna and BPEL in building scientific workflows: the case of caGridConcurrency and Computation: Practice and Experience, 2010
2009
- Wrap scientific applications as WSRF grid services using gRAVIIn 2009 IEEE International Conference on Web Services, 2009
- Scientific workflows as services in caGrid: a Taverna and gRAVI approachIn 2009 IEEE International Conference on Web Services, 2009
- GridFTP GUI: An easy and efficient way to transfer data in gridIn International Conference on Networks for Grid Applications, 2009
- DataMover: A Lightweight, Extensible Data Movement Framework for Grid Environments.In PDPTA, 2009
2008
- caGrid 1.0: an enterprise Grid infrastructure for biomedical researchJournal of the American Medical Informatics Association, 2008
- Building scientific workflow with taverna and bpel: A comparative study in cagridIn International Conference on Service-Oriented Computing, 2008
- e-Science, caGrid, and translational biomedical researchComputer, 2008
- Combining the power of Taverna and caGrid: Scientific workflows that enable web-scale collaborationIEEE Internet Computing, 2008
- A roadmap for caGrid, an enterprise Grid architecture for biomedical researchStudies in health technology and informatics, 2008
- Build grid enabled scientific workflows using gravi and tavernaIn 2008 IEEE Fourth International Conference on eScience, 2008
- Orchestrating cagrid services in tavernaIn 2008 IEEE International Conference on Web Services, 2008
- Integrating caGrid and TeraGridIn Proc. 3rd Ann. TeraGrid Conf, 2008
- Enabling petascale science: Data management, troubleshooting, and scalable science servicesIn Journal of Physics: Conference Series, 2008
- Build Grid Enabled Scientific Workflows Using gRAVI and Taverna2008
- Building Scientific Workflow with Taverna and BPEL: A Comparative Study in caGridIn 4th International Workshop on Engineering Service-Oriented Applications (WESOA), 2008
2007
- caGrid 1.0: A grid enterprise architecture for cancer researchIn AMIA Annual Symposium Proceedings, 2007
2006
- Streamlining Grid operations: definition and deployment of a portal-based user registration serviceJournal of Grid Computing, 2006
2004
- Reliable data transport: A critical service for the gridIn Building service based grids workshop, Global Grid Forum, 2004
- An overview of grid file transfer patterns and their implementation in the java cog kitNeural, Parallel & Scientific Computations, 2004
- Reliable data transport: A critical service for the gridIn In Building Service Based Grids Workshop, Global Grid Forum, 2004
2003
- Lessons learned producing an ogsi compliant reliable file transfer serviceIn Global Grid Forum Workshop on Designing and Implementing Grid Services (held in conjunction with GGF9), 2003
2002
- Reliable file transfer in grid environmentsIn 27th Annual IEEE Conference on Local Computer Networks, 2002. Proceedings. LCN 2002., 2002
- The CardioVascular Research Grid (CVRG) Project2002
- Perspectives on Informatics2002
- OGSA-DMI Functional Specification 1.02002
- Research Data management using Globus Online2002
- eScience 20252002