New Java Data Science Libraries 2026
last commit 1 week ago apache/zeppelin 6K +2
added 8 months ago
Web-based notebook that enables data-driven, interactive data analytics and collaborative documents.
last commit 3 weeks ago jtablesaw/tablesaw 3K +1
added 11 months ago
Tablesaw is a dataframe and visualization library that supports loading, cleaning, transforming, filtering, and summarizing data.
Java DataFrame Libraries Data Science Libraries AI / ML Libraries
last commit 1 day ago openrefine/openrefine 11K +10
added 12 months ago
OpenRefine is a powerful free, open source tool for working with messy data: cleaning it; transforming it from one format into another; and extending it with web services and external data.
last commit 2 weeks ago dflib/dflib 312
added 1 year ago
A lightweight pure Java implementation of a common DataFrame data structure.
Java Data Structures Data Science Libraries DataFrame Libraries
last commit 19 hours ago haifengl/smile 6K +9
added 1 year ago
Smile is a fast and comprehensive machine learning engine.
last commit 1 week ago apache/systemds 1K +1
added 1 year ago
An open source ML system for the end-to-end data science lifecycle
Java Data Platforms Data Science Libraries AI / ML Libraries
last commit 2 days ago apache/commons-statistics 64 +1
added 1 year ago
The Apache Commons Statistics project provides tools for statistics.
Java Math Libraries Statistics Libraries Data Science Libraries