Y
Hacker News
new | ask | show | jobs
A Long-Tail Professional Forum-Based Benchmark for LLM Evaluation
1 points by wslh 3 hours ago | 0 comments