A new report from plagiarism detector Copyleaks found that 60% of OpenAI’s GPT-3.5 outputs contained some form of plagiarism.

Why it matters: Content creators from authors and songwriters to The New York Times are arguing in court that generative AI trained on copyrighted material ends up spitting out exact copies.

  • FaceDeer@kbin.social
    link
    fedilink
    arrow-up
    37
    arrow-down
    5
    ·
    8 months ago

    Not to mention that a response “containing” plagiarism is a pretty poorly defined criterion. The system being used here is proprietary so we don’t even know how it works.

    I went and looked at how low theater and such were and it’s dramatic:

    The lowest similarity scores appeared in theater (0.9%), humanities (2.8%) and English language (5.4%).