Phind is now using a V7 of their model for their own platform, as they have found that people overall prefer that output vs GPT4. This is extremely impressive because it’s not just a random benchmark that can be gamed, but instead crowd sourced opinion on real tasks

The one place everything still lags behind GPT4 is question comprehension, but this is a huge accomplishment

Blog post: https://www.phind.com/blog/phind-model-beats-gpt4-fast

note: they’ve only open released V2 of their model, hopefully they release newer versions soon… would love to play with them outside their sandbox

  • Marians@sh.itjust.works
    link
    fedilink
    English
    arrow-up
    1
    ·
    1 year ago

    this is one of the most plausible claims to date because it is supported by anecdotal data from actual use scenarios rather than only benchmark games. puppet hockey

    • noneabove1182@sh.itjust.worksOPM
      link
      fedilink
      English
      arrow-up
      3
      ·
      1 year ago

      Another what? Claiming to be better than gpt4? If so, I think this might be one of the most reasonable times it’s been claimed, with, albeit anecdotal, evidence from real use cases instead of just gaming a benchmark

      • Anony Moose@lemmy.ca
        link
        fedilink
        English
        arrow-up
        2
        ·
        1 year ago

        It’s kind of surprising that none of the big tech companies have managed to beat GPT4 yet. Let’s hope Gemini finally gives it some proper competition.

        • noneabove1182@sh.itjust.worksOPM
          link
          fedilink
          English
          arrow-up
          1
          ·
          1 year ago

          I almost wonder if they have but they’re holding back until they have something that’s more game breaking, cause let’s be honest if Gemini releases and says “we’re better than gpt4” people won’t flock to it, they need something that’s a standout feature to make people want to switch

          • Anony Moose@lemmy.ca
            link
            fedilink
            English
            arrow-up
            3
            ·
            1 year ago

            Honestly, if Google’s claims about Gemini are anywhere near true and if its performance is anywhere near GPT4’s, then the AI model world will be shaken up. Since Demis Hassabis and team from DeepMind are working on Gemini, I expect some really incredible things from it.