• threelonmusketeers@sh.itjust.works
    link
    fedilink
    English
    arrow-up
    4
    ·
    1 month ago

    but they seem to average about 20%. This seems like a terrible record of failure for an AI tool that touts its precision.

    That does seem pretty bad.

    To play devil’s advocate for a moment, what systems were they using before implementing the AI tool? Were those systems better? Seems like a low bar to beat…