• atzanteol@sh.itjust.works
    link
    fedilink
    English
    arrow-up
    12
    arrow-down
    3
    ·
    5 days ago

    This is just brilliant. Every ridiculous argument addressed perfectly.

    but you have no idea what the code is

    Are you a vibe coding Youtuber? Can you not read code? If so: astute point. Otherwise: what the fuck is wrong with you?

    You’ve always been responsible for what you merge to main. You were five years go. And you are tomorrow, whether or not you use an LLM.

    I want to scream every time somebody brings up “but it writes code that doesn’t work” and all I can think of is “what the fuck is wrong with you that you’re merging code that doesn’t work?” LLMs do not remove your responsibility as a developer to create a working product.

    • SouthFresh@lemmy.world
      link
      fedilink
      English
      arrow-up
      8
      arrow-down
      2
      ·
      5 days ago

      I’ve played with QwenCoder2.5, Qwen3, and Devstral.

      Holy shit are they bad. Seriously, consistently bad at coding. Initialized variables that are never used. Importing, using functions/methods that don’t exist, it’s fucking pathetic.

      • atzanteol@sh.itjust.works
        link
        fedilink
        English
        arrow-up
        6
        arrow-down
        2
        ·
        5 days ago

        I don’t know what to tell ya - GPT 4o does a really good job. Feel free to simply blame “ai slop” for everything though.

        • Endmaker@ani.social
          link
          fedilink
          English
          arrow-up
          1
          ·
          edit-2
          8 hours ago

          Kinda late to the party but based on my day-to-day usage of ChatGPT, 4o is rubbish when it comes to coding.

          Now o4-mini-high on the other hand - that’s the good stuff.

        • ikt@aussie.zoneOP
          link
          fedilink
          English
          arrow-up
          1
          ·
          5 days ago

          yep I’ve been told Gemini is the new hot shit, really hoping local models can catch up

  • hendrik@palaver.p3x.de
    link
    fedilink
    English
    arrow-up
    7
    arrow-down
    1
    ·
    5 days ago

    So should I try the Zed editor? I’ve tried AI assisted coding but never with a fully “immersive” experience. And I have a ton of small little woes, the code is riddled with small little annoyances and bugs and I end up rephrasing and doing several tries until I arrive at something which I still need to refactor for an hour or so… So does this apply to people who need to uphold some level of quality, and people who can’t just change the programming language of an entire existing project so it works better with AI?

    • SGforce@lemmy.ca
      link
      fedilink
      English
      arrow-up
      3
      arrow-down
      1
      ·
      edit-2
      5 days ago

      Local models are not capable of coding yet, despite what benchmarks say. Even if they get what you’re trying to do they spew out so many syntax errors and tool calling problems that it’s a complete waste of time. But if you’re using an API then I don’t see why not one editor over another. They’ll be different in implementation but generally pull off the same things

      • brucethemoose@lemmy.world
        link
        fedilink
        English
        arrow-up
        8
        ·
        edit-2
        5 days ago

        Local models are not capable of coding yet, despite what benchmarks say. Even if they get what you’re trying to do they spew out so many syntax errors and tool calling problems that it’s a complete waste of time.

        I disagree with this. Qwen Coder 32B and on have been fantastic for niches with the right settings.

        If you apply a grammar template and/or start/fill in their response, drop the temperature a ton, and keep the actual outputs short, it’s like night and day vs ‘regular’ chatbot usage.

        TBH one of the biggest problems with LLM is that they’re treated as chatbot genies with all sorts of performance-degrading workarounds, not tools to fill in little bits of text (which is what language models were originally concieved for).

      • hendrik@palaver.p3x.de
        link
        fedilink
        English
        arrow-up
        2
        arrow-down
        1
        ·
        edit-2
        5 days ago

        Alright. I mean I haven’t used local models for coding. This was ChatGPT, AIstudio and Grok I tried. I can’t try Claude, since they want my phone number and I’m not going to provide that to them. I feel DeepSeek and a few other local models should be able to get somewhere in the realm of commercial services, though. At least judging by the coding benchmarks, we have some open-weight competition there.

    • ikt@aussie.zoneOP
      link
      fedilink
      English
      arrow-up
      3
      ·
      edit-2
      5 days ago

      it’s weird because the post has a massive amount of downvotes in an ai friendly sub, even the hackernews rss bot is being downvoted!

      I think the “cross-posted to” feature is being abused by anti ai zealots

      • brucethemoose@lemmy.world
        link
        fedilink
        English
        arrow-up
        3
        ·
        edit-2
        5 days ago

        It’s not bots, it just how local ML posts are on the internet.

        I got banned from a Reddit fandom sub for the mere suggestion that a certain fan ‘remaster’ be updated with newer diffusion/GAN models. Apparently they weren’t aware the original was made with Waifu2x… But unfortunately, anything tangential to tech bro AI is radioactive.

        • ikt@aussie.zoneOP
          link
          fedilink
          English
          arrow-up
          3
          ·
          edit-2
          5 days ago

          sorry i mean the rss bot:

          https://lemmy.bestiver.se/post/419818

          this was posted on technology@lemmy.world who are super anti ai so it’s no surprise they’re downvoting everything

          Next time might do an archive link and put the real article in the body

          edit: just swapped the links around, will see how this goes

          • brucethemoose@lemmy.world
            link
            fedilink
            English
            arrow-up
            2
            ·
            5 days ago

            Eh, I still bet it was really people browsing /new who downvoted it.

            Honestly I get it, with how enshittified corporate portals/use is already, but still.