• DrCake@lemmy.world
    link
    fedilink
    English
    arrow-up
    1
    ·
    15 days ago

    So when’s the ruling against OpenAI and the like using the same copyrighted material to train their models

    • irotsoma@lemmy.world
      link
      fedilink
      English
      arrow-up
      1
      ·
      15 days ago

      But OpenAI not being allowed to use the content for free means they are being prevented from making a profit, whereas the Internet Archive is giving away the stuff for free and taking away the right of the authors to profit. /s

      Disclaimer: this is the argument that OpenAI is using currently, not my opinion.

  • MigratingtoLemmy@lemmy.world
    link
    fedilink
    English
    arrow-up
    1
    ·
    15 days ago

    If OpenAI can get away with going through copy-righted material, then the answer to piracy is simple: round up a bunch of talented Devs from the internet who are writing and training AI models, and let’s make a fantastic model trained on what the internet archive has. Tell you what, let Mistral’s engineers lead that charge, and put an AGPL license on the project so that companies can’t fuck us over.

    I refuse to believe that nobody has thought of this yet

    • bandwidthcrisis@lemmy.world
      link
      fedilink
      English
      arrow-up
      1
      ·
      15 days ago

      An AI trained on old Internet material would be like a synthetic Grandpa Simpson:

      “In my day we said ‘all your base’ and laughed all day long, because it took all day to download the video.”