• Zos_Kia@jlai.lu
    link
    fedilink
    English
    arrow-up
    2
    ·
    17 hours ago

    Yes i saw that benchmark and was honestly not surprised with the results. It seems that Anthropic really focused on those issues above and beyond what was done in other labs.

    • probably2high@lemmy.world
      link
      fedilink
      English
      arrow-up
      1
      ·
      4 hours ago

      With its prior government contact, maybe anthropic was tuning it to ward against all the fucking dolts in decision-making roles.