It looks like AI has followed Crypto chip wise in going CPU > GPU > ASIC

GPUs, while dominant in training large models, are often too power-hungry and costly for efficient inference at scale. This is opening new opportunities for specialized inference hardware, a market where startups like Untether AI were early pioneers.

In April, then-CEO Chris Walker had highlighted rising demand for Untether’s chips as enterprises sought alternatives to high-power GPUs. “There’s a strong appetite for processors that don’t consume as much energy as Nvidia’s energy-hungry GPUs that are pushing racks to 120 kilowatts,” Walker told CRN. Walker left Untether AI in May.

Hopefully the training part of AI goes to ASIC’s to reduce costs and energy use but GPU’s continue to improve inference and increase VRAM sizes to the point that AI requires nothing special to run it locally

    • brucethemoose@lemmy.world
      link
      fedilink
      English
      arrow-up
      3
      ·
      edit-2
      12 days ago
      • Tinygrad is (so far) software only, ostensibly sort of a lightweight PyTorch replacement.

      • Tinygrad is (so far) not really used for much, not even research or tinkering.

      Between that and the lead dev’s YouTube antics, it kinda seems like hot air to me.

        • brucethemoose@lemmy.world
          link
          fedilink
          English
          arrow-up
          4
          ·
          edit-2
          12 days ago

          That’s a premade 8x 7900 XTX PC. All standard and off the shelf.

          I dunno anything about Geohot, all I know is people have been telling me how cool Tinygrad is for years with seemingly nothing to show for it other than social media hype, while other, sometimes newer, PyTorch alternatives like TVM, GGML, the MLIR efforts and such are running real workloads.