lain,
@lain@lain.com avatar

this is the kind of performance you can expect from a current top performing local llm model, i think they have surpassed gpt 3.5 by now.

epictittus,
@epictittus@poa.st avatar

@lain Is it possible to train them not to qualify every answer. Like just delete the last few sentences bro.

lain,
@lain@lain.com avatar

@epictittus yeah, that's just the default 'persona'

snacks,
@snacks@netzsphaere.xyz avatar

deleted_by_author

  • Loading...
  • lain,
    @lain@lain.com avatar

    @snacks i think it's close. it's a lot more useful for anything that involves wrongthink, gpt-4 is so heavily censored, llama3 not so much

    sun,
    @sun@shitposter.world avatar

    @lain @snacks is there any way to remove the shit from models that makes it say "it's important to remember that"

    lain,
    @lain@lain.com avatar

    @sun @snacks yeah, it can be trained out, or you can add a different system prompt, or add the beginning of the reply yourself with something like "sure, here's the answer"

    sun,
    @sun@shitposter.world avatar

    @lain @snacks I still need to learn how to fine tune models. I spent a day trying and failed, but I was using too big of a model for the unquantized or whatever its called to be usable

    lain,
    @lain@lain.com avatar

    @sun @snacks check out unsloth

    arcana,
    @arcana@layer02.net avatar

    @sun @lain @snacks Remember when there was that history AI that let you speak to Hitler and he told you how deeply sorry he was and how it was important to remember the death toll of the holocaust

    sun,
    @sun@shitposter.world avatar

    @arcana @lain @snacks I made a neo nazi character to try to experiment with "deradicalization" and it was impossible to get the AI to fully commit to it, periodically he would say things like how much diversity mattered despite his leanings toward national socialism. still more work to be done.

    arcana,
    @arcana@layer02.net avatar
    lain,
    @lain@lain.com avatar
    lain,
    @lain@lain.com avatar

    @sun @arcana @snacks there are often two different version of each model, a normal one and an 'instruct' one, the instruct one is just the normal one trained with a specific question/answer format, and that's the part where it gets censored / locked down.

    kaia,
    @kaia@brotka.st avatar

    @sun @lain @snacks
    GPT 4o with "how would you like ChatGPT to respond" set to "keep the responses brief and to the point, avoiding additional explanations that contextualize or add a relativization"

    sun,
    @sun@shitposter.world avatar

    @kaia @lain @snacks I mean like I don't even want it in the model so I don't have to eat up prompt space telling it to shut up

  • All
  • Subscribed
  • Moderated
  • Favorites
  • random
  • DreamBathrooms
  • everett
  • InstantRegret
  • magazineikmin
  • thenastyranch
  • rosin
  • GTA5RPClips
  • Durango
  • Youngstown
  • slotface
  • khanakhh
  • kavyap
  • ngwrru68w68
  • tacticalgear
  • JUstTest
  • osvaldo12
  • tester
  • cubers
  • cisconetworking
  • mdbf
  • ethstaker
  • modclub
  • Leos
  • anitta
  • normalnudes
  • megavids
  • provamag3
  • lostlight
  • All magazines