Claude Opus 4.5, and why evaluating new LLMs is increasingly difficult (simonwillison.net)
5 points by jonesn11 5 hours ago | 1 comment
1515 points by jonesn11 5 hours ago | 1 comment
1511 points by CrazyCompiler01 5 hours ago | 1 comment
1523 points by elliot_a 5 hours ago | 0 comments
1535 points by nailer 5 hours ago | 3 comments
1541 points by transpute 5 hours ago | 0 comments
1553 points by transpute 5 hours ago | 0 comments
1562 points by decimalenough 5 hours ago | 3 comments
1572 points by benodiwal 5 hours ago | 0 comments
1584 points by Aissen 5 hours ago | 0 comments
1591 points by Lunar5227 5 hours ago | 0 comments
1601 points by Brajeshwar 5 hours ago | 0 comments
1611 points by mnbbrown 5 hours ago | 0 comments
1622 points by doener 5 hours ago | 0 comments
1632 points by Brajeshwar 5 hours ago | 0 comments
1645 points by blutoot 5 hours ago | 9 comments
1651 points by falcon_ 5 hours ago | 0 comments
1662 points by lu794377 6 hours ago | 0 comments
1674 points by robertomisuraca 6 hours ago | 5 comments
1682 points by faebi 6 hours ago | 2 comments
1691 points by thwg 6 hours ago | 1 comment
1702 points by cubefox 6 hours ago | 0 comments
1711 points by Mikecraft 6 hours ago | 1 comment
1723 points by stagas 6 hours ago | 0 comments
1738 points by zerosizedweasle 6 hours ago | 1 comment
1742 points by T-A 6 hours ago | 0 comments
1751 points by Nimer 6 hours ago | 1 comment
1763 points by simonebrunozzi 6 hours ago | 0 comments
1772 points by marksaver 6 hours ago | 0 comments
1784 points by I_Nidhi 6 hours ago | 0 comments
1793 points by jesprenj 6 hours ago | 2 comments
180