Task-Specific LLM Evals That Do and Don't Work

Created 6mo | Dec 9, 2024, 4:20:08 PM


Login to add comment

Other posts in this group

Show HN: Nexus.js - Fabric.js for 3D

I was looking for a tiny library to easily transform both 2D & 3D objects with simple mouse / touch controls and a fixed camera, in the browser.

Like a simple 3D editor but without requiring the

Jun 16, 2025, 10:10:10 PM | Hacker news
Show HN: Chawan TUI web browser

A terminal-based web browser in Nim.[1] Has acceptable (YMMV) CSS rendering, some JS support, and inline images (sixel/kitty). It can also use various protocols other than http(s) such as (s)ftp

Jun 16, 2025, 10:10:09 PM | Hacker news