Show HN: Buckaroo – Data table UI for Notebooks

Buckaroo is my open source project. It is a dataframe viewer that has the basic features we expect in a modern table - scroll, search, sort. In addition there are summary stats, and histograms available. Buckaroo support Pandas and Polars dataframes and works on Jupter, Marimo, VSCode and Google Colab notebooks. All of this is extensible. I think of Buckaroo as a framework for building table UIs, and an initial data exploration app built on top of that framework. AG-Grid is used for the core table display and it has been customized with a declarative layer so you don't have to pass JS functions around for customizations. On the python side there is a framework for adding summary stats (with a small DAG for dependencies). There is also an entire Low Code UI for point and click selection of common commands (drop column). The lowcode UI also generates a python function that accomplishes the same tasks. This is built on top of JLisp - a small lisp interpreter that reads JSON flavored lisp.

Auto Cleaning looks at columns and heuristically suggests common cleaning operations. The operations are added to the lowcode UI where they can be edited. Multiple cleaning strategies can be applied and the best fit retained. Autocleaning without a UI and multiple strategies is very opaque. Since this runs heuristically (not with an LLM), it’s fast and data stays local.

I'm eager to hear feedback from data scientists and other users of dataframes/notebooks.


Comments URL: https://news.ycombinator.com/item?id=44022265

Points: 30

# Comments: 3

https://github.com/paddymul/buckaroo

Vytvořeno 3h | 18. 5. 2025 18:10:13


Chcete-li přidat komentář, přihlaste se

Ostatní příspěvky v této skupině

Show HN: Vaev – A browser engine built from scratch (It renders google.com)

We’ve been working on Vaev, a minimal web browser engine built from scratch. It supports HTML/XHTML, the CSS cascade, @page rules for pagination, and print-to-PDF rendering. It even handles calc()

18. 5. 2025 20:30:03 | Hacker news
Show HN: Stack Error – ergonomic error handling for Rust

Stack Error reduces the up-front cost of designing an error handling solution for your project, so that you focus on writing great libraries and applications.

Stack Error has three goals:

1. P

18. 5. 2025 20:30:02 | Hacker news
Show HN: Model2vec-Rs – Fast Static Text Embeddings in Rust

Hey HN! We’ve just open-sourced model2vec-rs, a Rust crate for loading and running Model2Vec static embedding models with zero Python dependency. This allows you to embed text at (very) high throu

18. 5. 2025 18:10:16 | Hacker news