MistralAI releases `Magistral`, their first official reasoning models. magistral small 2506 released under apache 2.0 license!

SmokeyDope@lemmy.world · 5 days ago

Nice post Hendrik thanks for sharing your knowledge and helping people out :)

SmokeyDope@lemmy.world · 5 days ago

I once got kobold.CPP working with their collection of TTS model+ wav tokenizer system. Here’s the wiki page on it.

It may not be as natural as a commercial voice model but may be enough to wet your appetite in the event that other solutions feel overwhelmingly complicated

SmokeyDope@lemmy.world · edit-2 5 days ago

Wow this is some awese information Brucethemoose thanks for sharing!

I hope you dont mind if I ask some things. Tool calling is one of those things I’m really curious about. Sorry if this is too much please dont feel pressured you dont need to answer everything or anything at all. Thanks for being here.

I feel like a lot of people including myself only vaguely understand tool calling, how its supposed to work, and simple practice excersises to use it on via scripts and APIs. What’s a dead simple python script someone could cook to tool call within the openai-compatable API?

In your own words what exactly is tool calling and how does an absolute beginner tap into it? Could you clarify what you mean by ‘tool calling being built into their tokenizers’?

Would you mind sharing some sources where we can learn more? I’m sure huggingface has courses but maybe you know some harder to find sources?

Is tabbyAPI an engine similar to ollama, llama.cpp, ect?

What is elx2,3, ect?

SmokeyDope@lemmy.world · 6 days ago

Pangolin.

SmokeyDope@lemmy.world · 7 days ago

Yes it would have been awesome of them to release a bigger one for sure :( At the end of the day they are still a business that needs a product to sell. I don’t want to be ungrateful complaining that they dont give us everything. I expect some day all these companies will eventually clam up and stop releasing models to the public all together once the dust settles and monopolies are integrated. I’m happy to be here in an era where we can look forward to open licence model released every few months.

SmokeyDope@lemmy.world · edit-2 7 days ago

At the end of the day you need to decide what kind of person you are. Are you pragmatic or idealistic? Are you able to separate art from artist and creation from creator? Should you support a good open source service created for the betterment of everyone if you dont like the politics of its developers?

I’m a pragmatist by nature. I believe that a useful tool remains a useful tool even when its crafted by tankie assholes. If I found out the maker of a computer command like sudo was a leninist or whatever I wouldnt go out of my way to install an alternative just because I dont agree with the batshit politics of the creator. Just like I wouldn’t stop enjoying a song after finding out the ones who made it were greedy egotistical dickheads in personal life.

Young and politically charged idealist love that online social justice warrior signalling and political identity posturing. Everythings gotta be us vs them culture war, with us needing to always be on the morally/politically high ground else your a filthy inhuman nazi them who must be refunded/canceled. You get older and realize most people no matter the lean have some level of dogshit half baked politics or some other degree of mental emotional whackiness from past trauma or poor life circumstances causing them to be imperfect animals with dumb fucking biases. That’s humanity and the heart of darkness for ya. You can choose to associate the imperfections of the creator with the creation, or you can try to decouple them and see them as separate entities linked by causality.

SmokeyDope@lemmy.world · 7 days ago

The free API is the gateway drug to selling the product to customers (devs). The average consoomers attention and data is the product to be sold to customers (advertisers and big brother)

SmokeyDope@lemmy.world · 7 days ago

In the comics Thanos was straight up the right man hand/side hoe of the concept of death made manifest the whole point of offing half of all life was just to make mistress death happy (I think she actually ordered it). A lot of the original plot points of the infinity war and its buildup was lost in translation. Movie thanos motivations werent even half baked, he could have doubled resources and made a paradise for life with his motivations

SmokeyDope@lemmy.world · edit-2 7 days ago

MistralAI releases `Magistral`, their first official reasoning models. magistral small 2506 released under apache 2.0 license!

SmokeyDope@lemmy.world · 7 days ago

Ding Ding, check this comment chain for your answer. Today you, tomorrow me.

SmokeyDope@lemmy.world · edit-2 7 days ago

Question one: yes and no. Most of the vomiting emojis shared here in comments are fake made using googles emoji kitchen thing. But there are many real modifiers for emojis like skin color or adding accents like tildes to regular english alphabet characters.

Question two: Modern keyboards typically have most emojis built in for you to select through. I dont think typing in the unicode values will automatically convert on phone operating systems but this might help if using windows or programming into a website.

the XKCD explained article on this actually gave some really great info.

SmokeyDope@lemmy.world · 8 days ago

You can tell because of checks notes “ThE ThIcK LInES AnD Off ColoURinG”

SmokeyDope@lemmy.world · 8 days ago

Devstral was released recently specifically trained for tool calling in mind. I havent personally tried it out yet but people say it works good with vscode+roo

SmokeyDope@lemmy.world · edit-2 8 days ago

Thanks for the input! I do eventually plan on making some scripts and a custom web interface to interact with/expose some local services on my network once I have the basics of HTML covered as part of a portfolio thing so would like to cover my ass early and not have problems later

SmokeyDope@lemmy.world · edit-2 8 days ago

Got any security advice for setting up a locally hosted website/external service?

SmokeyDope@lemmy.world · edit-2 8 days ago

Got any security advice for setting up a locally hosted website/external service?

SmokeyDope@lemmy.world · edit-2 9 days ago

Havent heard of this one before now. It will be interesting to see how it actually performs. I didnt see what license the models will be released under hope its a more permissive one like apache. Their marketing should try cooking up a catchy name thats easy to remember. It seems they’re a native western language company so also hope it doesnt have too much random Chinese characters like qwen does sometimes

Ive never really gotten into MoE models, people say you can get great performance gains with clever partial offloading strategy between various experts. Maybe one of these days!

SmokeyDope@lemmy.world · edit-2 9 days ago

If your running into the issue of an app wanting an api key for your local ollamas openai-compatable web interface API and refuses to work without one, I found that any random characters work. If you port forward your host computer you should be able to access the webui interface on an external network using the public IP.

Heres the dead simple python program I used to send and recieve text to kobold.cpp engine through the web API. Not sure how similar ollama but afaik openai-compatable API means it all should works close to the same for compatibility(I think? lol!) if you give it a shot Make sure to set the .py file you make as executable and run it from a terminal doing ./filename.py to see the output in real time. It should make a log text file in same dir as the program too. Just use your host computers local ip if the python script pc is on same network.

spoiler

import requests

# Configuration
API_URL = "http://10.0.0.xx:5001/api/v1/generate"
PROMPT = "Tell me a short story about a robot learning to dance."
OUTPUT_FILE = "output.txt"

# Define the API request data
data = {
    "prompt": PROMPT,
    "max_length": 200,      # Adjust response length
    "temperature": 0.7,     # Control randomness (0=deterministic, 1=creative)
    "top_p": 0.9,           # Focus on high-probability tokens
}

# Send the request to kobold.cpp
response = requests.post(API_URL, json=data)

if response.status_code == 200:
    # Extract the generated text
    result = response.json()
    generated_text = result["results"][0]["text"]
    
    # Save to a text file
    with open(OUTPUT_FILE, "w") as f:
        f.write(generated_text)
    print(f"Response saved to {OUTPUT_FILE}!")
else:
    print(f"Error: {response.status_code} - {response.text}")

SmokeyDope@lemmy.world · 9 days ago

What is this post even talking about why does your silverware have a pillow

SmokeyDope@lemmy.world · 9 days ago

Just wondering, why do you think that coke users stay away from psychadelics and vice versa?

SmokeyDope@lemmy.world · edit-2 9 days ago

VSCode + roo plugin seems to be all the hotness for coders leveraging ‘agenic teams’ so I spent a bit playing around with it. Most local models dont do tool calling very well I need to see if devstral works better without giving errors. I hear real professionals use claude API for that kind of stuff.

Im only vaguely familiar with getting computers to send, recieve, and manipulate data with eachother on a local network so got a very basic python script going pointed at kobold cpps openai-compatable API to send prompts and recieve repliesinstead of the default webui app just to learn how it works under the hood.

One of my next projects will be creating a extremely simple web based UI for my ereaders basic web browser to connect to. kobold has something similar with the /noscript subpage but even that is too much for my kobo reader. I intend to somehow leverage a gemtext to html proxy like ducking or newswaffle to make the page rendering output dead simple.

One of these days im going to get a pi zero and attach it to a relay and see if I can get a model to send a signal to turn a light on and off. Those home automation people with the smart houses that integrate llms into things look soo cool

SmokeyDope@lemmy.world · edit-2 10 days ago

Updated guidelines for c/LocaLLama (new rules)

SmokeyDope@lemmy.world · edit-2 11 days ago

Plot twist: she knew and didn’t care. “Its a doggish canine social enough to accepts the pets, a couple rabies shots and its good enough for me.”

SmokeyDope@lemmy.world · edit-2 11 days ago

“The Grey” is the first thing to pop into my head.

The stupid meme worthy part is the way wolves are presented as a threat in the movie is so over the top like old childrens folklore level omnipresent coked out superwolves with a 100 mile killing radius stripping the territory all threatening life larger than a squrrel and enough intelligence for tactical strategizing to pick an entire group of men 1 by 1.

The way everything else is executed is what turns it around. The cinematography and the emotional human story of the main character guys motivations and interactions with the res5 of the group is fun. It makes it a good watch to spend an hour or two of your life on. The cast has some bangers and the acting is great.

SmokeyDope@lemmy.world · edit-2 20 days ago

DeepSeek just released updated r1 models with 'deeper and more complex reasoning patterns'. Includes a r1 distilled qwen3 8b model boasting "10% improved performance" over original

SmokeyDope@lemmy.world · edit-2 21 days ago

Advice for picking a PSU for server class GPUs? Also a question about adapter cable

SmokeyDope@lemmy.world · edit-2 22 days ago

Using local model with basic RAG to help reference rules when playing table top game

SmokeyDope@lemmy.world · 1 month ago

Anthropic's 'On the Biology of a LLM' got a massive update: Features fascinating deep dives into how models process information behind the scenes

SmokeyDope@lemmy.world · edit-2 3 months ago

YSK theres a open source tool to cleanly read webpage articles called 'NewsWaffle'

SmokeyDope

Moderates