New York City’s healthcare system generates vast amounts of medical data daily, but much of it remains siloed and underutilized — slowing research, decision-making, and patient care.
What if AI could change that?
Anurag Patil, a senior data engineer and machine learning expert at a leading healthcare system in New York City, is doing just that. By unifying and structuring healthcare data and priming it for use in AI models, he’s working to bridge the gap between fragmented healthcare data and meaningful, real-world impact. He’s not just improving healthcare data pipelines — he’s creating scalable, efficient, and equitable artificial intelligence solutions that could redefine how medical systems operate worldwide.
In a city like New York — where healthcare institutions serve millions of patients from vastly different backgrounds — this work is not just technical. It’s shaping the future of AI-driven medicine to be truly inclusive, accessible, and transparent.
The Challenge: Data Fragmentation Is Holding Healthcare Back
The U.S. healthcare system generates more than 2.5 quintillion bytes of data daily, a vast amount of information that has the power to unlock meaningful insights in research, diagnosis, and efficiency. But unfortunately, much of it is siloed — locked away in isolated, incompatible systems.
For researchers, this means critical insights are buried in unstructured datasets, delaying breakthroughs in disease detection and treatment. For doctors, it means that patient histories are often incomplete or scattered across different platforms, leading to inefficiencies, misdiagnoses, and suboptimal treatment plans.
New York City represents both the greatest challenge and the greatest opportunity for solving this issue. With a diverse population and large number of hospitals in its healthcare system, integrating data across NYC institutions could set a precedent for the rest of the country.
“If AI-driven healthcare can work in New York, it can work anywhere,” Patil explains.
Patil’s Early Career
Patil graduated with a bachelor’s in engineering from the Birla Institute of Technology and Science (BITS), followed by a master’s in computational science, engineering, and mathematics from UT Austin. During his studies, he specialized in natural language processing, a subset of machine learning that enables computers to interpret and generate human language. If this sounds familiar, it should — it’s the technology that powers tools like ChatGPT.
After graduating, Patil joined a major American electric vehicle manufacturer in Palo Alto, California, as a data engineer. There, he led the development of multiple successful projects, including a pipeline that streamlined the ingestion of R&D data. This drastically reduced data processing time about thirty times over, allowing the company to complete research cycles much faster.
While this work revolutionized R&D, it also sparked a realization in Patil: AI’s greatest impact wasn’t in industry but in human lives. This inspired him to apply his expertise where it mattered most: healthcare.
Building the Backbone of AI in Medicine
Determined to meaningfully channel his expertise, Patil joined one of the largest and most prominent healthcare systems in New York as a senior data engineer in 2023.
There, his work centers on engineering the fundamental infrastructure that allows AI to operate effectively in healthcare.
- Powering Data Unification at Scale: Patil has developed large-scale data integration frameworks that aggregate patient data from diverse sources — including electronic health records (EHR), pathology reports, radiology scans, and EEG readings — into a single, AI-ready system.
- Enabling AI-Driven Insights for Medical Decision-Making: By applying natural language processing and machine learning, Patil fine-tunes AI models that can extract critical information from clinical notes, predict patient risks, and assist in disease diagnosis.
- Automating Data Pipelines: Patil also designed a custom Python module, “datapilot” — working as an advanced automation tool, it processes and standardizes massive biomedical datasets, ensuring accuracy and consistency in AI-driven research.
Thanks to Patil, NYC’s mountain of healthcare data that was previously siloed and incompatible can finally be standardized and used to train powerful AI models, paving the way for groundbreaking developments in healthcare.
AI as a Force for Equitable Healthcare
One of the biggest risks in AI-driven healthcare is bias. Many healthcare AI models are trained on datasets that fail to fully represent the diversity of real-world patients, leading to inaccuracies when applied in different populations.
So for Patil, ensuring that AI-driven healthcare solutions work for everyone — and not just select demographics — is a core priority.
- New York City as a Testing Ground for Equitable AI: With its diverse population and wide socioeconomic range, NYC is the ideal environment to build AI models that are truly representative and fair.
- Breaking Down Barriers to Data Access: By unifying disparate patient records, Patil’s work ensures that historically underserved communities — who often receive fragmented care — can access more comprehensive and timely diagnoses, improving outcomes across demographic groups.
- Transparent and Ethical AI Systems: Patil advocates for AI models that are interpretable and accountable, ensuring that machine-generated medical insights can be trusted by physicians and patients alike.
“AI is not just about automation — it’s about augmenting human expertise in a way that is responsible, fair, and accessible to all,” says Patil.
Scaling AI-Driven Healthcare Beyond NYC
The systems Patil is designing today are not just for one hospital or research center. They are blueprints for AI-driven healthcare at scale.
His vision includes:
- Expanding AI-Driven Diagnostics: Making AI-powered disease detection available across a broader range of hospitals and research institutions.
- Developing AI Models That Adapt in Real-Time: Ensuring that medical AI can continually learn from new patient data while maintaining accuracy and fairness.
- Fostering Global Collaboration: Sharing data engineering best practices with research institutions worldwide to create more universal, ethical AI models for medicine.
If successful, Patil’s approach could serve as a model for AI-driven healthcare in cities across the U.S. and beyond.
AI as the Future of Healthcare — But Only if Done Right
AI is already transforming medicine — but only if it’s built responsibly. In NYC, Patil is proving that AI-driven healthcare isn’t just an innovation; it’s the future of medicine itself. And as he lays the foundation for replicable AI frameworks, he’s setting a precedent for a new era in healthcare analytics and innovation that promises to transform medical care around the world.
