Large language models often know when they are being evaluated

Article URL: https://arxiv.org/abs/2505.23836

Comments URL: https://news.ycombinator.com/item?id=44280113

Points: 30

# Comments: 33

https://arxiv.org/abs/2505.23836

Vytvorené 20d | 15. 6. 2025, 4:30:03

Ak chcete pridať komentár, prihláste sa

Ostatné príspevky v tejto skupine

Bcachefs may be headed out of the kernel

Bcachefs may be headed out of the kernel

Article URL: https://lwn.net/Articles/1027289/

Comments URL: https://news.ycombinator

4. 7. 2025, 18:50:16 | Hacker news

EverQuest

Article URL: https://www.filfre.net/2025/07/everquest/

Comments URL: https://

4. 7. 2025, 18:50:14 | Hacker news

Gremllm

Article URL: https://github.com/awwaiid/gremllm

Comments URL: https://news.ycombinat

4. 7. 2025, 18:50:12 | Hacker news

How to Incapacitate Google Tag Manager and Why You Should (2022)

How to Incapacitate Google Tag Manager and Why You Should (2022)

Article URL: https://backlit.neocities.org/incapacitate-google-tag-manager

Comments URL:

4. 7. 2025, 18:50:11 | Hacker news

LLMs caused drastic vocabulary shift in biomedical publications

LLMs caused drastic vocabulary shift in biomedical publications

Article URL: https://www.science.org/doi/10.1126/sciadv.adt3813

Comments URL:

4. 7. 2025, 18:50:11 | Hacker news

ChatGPT creates phisher's paradise by serving the wrong URLs for major companies

ChatGPT creates phisher's paradise by serving the wrong URLs for major companies

Article URL: https://www.theregister.com/2025/07/03/ai_phishing_websites/

Comments URL:

4. 7. 2025, 18:50:10 | Hacker news

Rust and WASM for Form Validation

Rust and WASM for Form Validation

Article URL: https://sebastian.lauwe.rs/blog/rust-wasm-form-validation/

Comments URL:

4. 7. 2025, 16:40:27 | Hacker news

Techie