Hey HN,
I’ve built Fahmatrix, a minimal, fast Java library for working with tabular data — inspired by Python’s pandas, but designed for performance and simplicity on the JVM.
After working extensively with Python’s data stack, I often ran into limitations related to speed, especially in larger or long-running data workflows. So I built Fahmatrix from scratch to offer similar APIs for manipulating CSVs, performing summary statistics, slicing rows/columns, and more — but all in Java.
Features:
Lightweight and dependency-free
CSV/TSV import with auto-headers
Series/DataFrame structures (like pandas)
describe(), mean(), stdDev(), percentile() and more
Fast parallel operations on numeric columns
Java 17+ support
Docs: https://moustafa-nasr.github.io/Fahmatrix/ GitHub: https://github.com/moustafa-nasr/fahmatrix
I’d love feedback from the Java and data communities — especially if you’ve ever wanted a simple dataframe utility in Java without needing full-scale ML libraries.
Happy to answer any questions!
Comments URL: https://news.ycombinator.com/item?id=44012074
Points: 4
# Comments: 0
Chcete-li přidat komentář, přihlaste se
Ostatní příspěvky v této skupině
Article URL: https://paulgraham.com/field.html
Comments URL: https://news.ycombinator
Article URL: https://vlmsarebiased.github.io/
Comments URL: https://news.ycombinator.c
Article URL: https://retrocomputing.stackexchange.co

Article URL: https://go.dev/blog/error-syntax
Comments URL: https://news.ycombinator.c

I kept rewriting the same array.slice(i, i + n) logic in every project. Eventually, I decided to stop and just make a utility.
Chonkify is a small (~870B), zero-dependency JS/TS library for chun