This is again a big win on the red team at least for me. They developed a “fully open” 3B parameters model family trained from scratch on AMD Instinct™ MI300X GPUs.

AMD is excited to announce Instella, a family of fully open state-of-the-art 3-billion-parameter language models (LMs) […]. Instella models outperform existing fully open models of similar sizes and achieve competitive performance compared to state-of-the-art open-weight models such as Llama-3.2-3B, Gemma-2-2B, and Qwen-2.5-3B […].

As shown in this image (https://rocm.blogs.amd.com/_images/scaling_perf_instruct.png) this model outperforms current other “fully open” models, coming next to open weight only models.

A step further, thank you AMD.

PS : not doing AMD propaganda but thanks them to help and contribute to the Open Source World.

  • Ulrich@feddit.org
    link
    fedilink
    English
    arrow-up
    1
    arrow-down
    10
    ·
    edit-2
    3 hours ago

    I don’t know why open sourcing malicious software is worthy of praise but okay.

      • Ulrich@feddit.org
        link
        fedilink
        English
        arrow-up
        1
        arrow-down
        5
        ·
        1 hour ago

        What’s malicious about AI and LLMs? Have you been living under a rock?

        At best it is useless, and at worst it is detrimental to society.

        • ZeroOne@lemmy.world
          link
          fedilink
          English
          arrow-up
          2
          ·
          55 minutes ago

          So in a nutshell, it’s malicious because you said so

          Ok gotcha Mr/Ms/Mrs TechnoBigot

        • Domi@lemmy.secnd.me
          link
          fedilink
          English
          arrow-up
          3
          ·
          1 hour ago

          I disagree, LLMs have been very helpful for me and I do not see how an open source AI model trained with open source datasets is detrimental to society.