Hi, I'm Jorge Guerra ๐
ML Engineer ยท Data Scientist ยท Entrepreneur ยท Builder
I've spent the last 10+ years building ML systems, data platforms, and products that solve real problems โ from saving banks millions with predictive models to deploying cardiovascular risk tools used by 500K+ people. My work spans machine learning, NLP, full-stack development, and mobile apps across banking, healthcare, research, and tech startups.
I'm equally comfortable building frontend interfaces, designing backend architectures, writing Python pipelines, or automating complex workflows end-to-end.
๐ What I'm Building Now
| Project | Role | Description |
|---|---|---|
| ๐งฌ Onkopilot | Co-Founder & CTO | AI-powered oncology support platform using RAG & LLMs to guide cancer patients and caregivers |
| ๐ Xsporty | Co-Founder & CTO | Sports community app connecting athletes, coaches, and facilities in Puerto Rico |
| ๐ฆท My Dental Path | CTO | Platform helping international dentists navigate US dental school admissions |
๐ผ Experience Highlights
Data Scientist Program Lead @ Popular Bank (2023 โ Present)
- ๐ฐ Saved $58M by building a tool to classify claimed vs. unclaimed properties
- ๐ฐ Saved $300K/year with an ML-based loan portfolio valuation model
- ๐ Reduced AML analyst workload by 10X with an XGBoost-powered alert detection system
Data Scientist @ One Drop (2021 โ 2022)
- ๐ซ Developed a cardiovascular risk model deployed to 500K+ users with diabetes
Senior Data Scientist @ KPMG (2019 โ 2021)
- ๐ฅ Built NLP + OCR pipeline automating 10,000+ manual medical record reviews
Data Scientist @ Children's Hospital of Philadelphia (2017 โ 2019)
- ๐ Won the Drexel LeBow Analytics 50 Award for patient no-show prediction model
- ๐ฌ Published at ACL CLPsych 2019 and IEEE EMBS 2018
๐ง Skills & Stack
Languages: Python ยท SQL ยท R ยท C# ยท MATLAB ยท Java
ML / AI: Scikit-learn ยท XGBoost ยท LightGBM ยท TensorFlow ยท PyTorch ยท SVMs ยท HMMs ยท Ensemble Methods
NLP: spaCy ยท NLTK ยท BioBERT ยท ClinicalBERT ยท Transformers ยท NER ยท Topic Modeling
GenAI: LLMs ยท RAG ยท LangChain ยท OpenAI API ยท Claude ยท Prompt Engineering ยท Fine-tuning
Cloud & Infra: AWS ยท Azure ยท Databricks ยท Docker ยท Git ยท CI/CD
Mobile & Web: Flutter ยท Dart ยท Firebase ยท Next.js ยท iOS ยท Android
Visualization & BI: Power BI ยท Tableau ยท Plotly ยท Matplotlib
๐ Selected Publications
- ๐ Cardiovascular Risk Prediction for Mobile Health Applications โ Intelligence-Based Medicine, 2025
- ๐ CV Disease Risk Variability Over Time in People with Diabetes โ AHA Scientific Sessions, 2021
- ๐ CLPsych 2019: Predicting Suicide Risk from Reddit Posts โ ACL Workshop, 2019
- ๐ SCOSY: A Biomedical Collaboration Recommendation System โ IEEE EMBS, 2018
- ๐ Upper Extremity Movement Classification for Stroke Rehab โ IEEE ICORR, 2017
๐ Education & Honors
๐ M.S. Data Science โ Columbia University (GEM Fellow)
๐ B.S. Computer Engineering โ University of Central Florida (McNair Scholar ยท Dean's List)
๐ Analytics 50 Award โ Analytics Magazine, 2018
๐ GEM Fellowship โ National GEM Consortium, 2014
๐ McNair Scholar โ Ronald E. McNair Program, 2013
๐ Based in San Juan, Puerto Rico ยท Open to Consulting
If you have a project that involves ML, NLP, data platforms, or full-stack product development โ let's talk.