So they are mass murderers. Good thing they didn’t rob a convenience store or they’d be in jail for life.
So they are mass murderers. Good thing they didn’t rob a convenience store or they’d be in jail for life.
This is literally just a tokenization artifact. If I asked you how many r’s are in /0x5273/0x7183 you’d be confused too.
However, if they become regulars, I think they eventually find communities they’d rather identify with.
You just ate garlic straight? Never heard of that.
Meh, Neovim has a fzf plugin. I use Logseq for linking. Linking is good because searching can fail to capture all the aliases for something, even if fuzzy
But I’m sure it’ll be useful for someone!
Why not just use text notes and fzf?
Exactly.
We will just put Trump porn on the whitehouse.gov site to show you how little control America has over its digital economy.
Damn I switched to proton last year and am NOT migrating again.
I thought it was based in the EU. Why does he care about the US at all?
Cthulu lives (runs away)
Wait. Protons CEO is conservative?
You can say other things. Good. It’s been better. I’m alive. Just keep it short.
Source? and … what?
Meta? The one that released Llama 3.3? The one that actually publishes its work? What are you talking about?
Why is it so hard to believe that deepseek is just yet another amazing paper in a long line of research done by everyone. Just because it’s Chinese? Everyone will adapt to this amazing innovation and then stagnate and throw compute at it until the next one. That’s how research works.
Not to mention China has billions of more people to establish a research community…
I think “just writing better code” is a lot harder than you think. You actually have to do research first you know? Our universities and companies do research too. But I guarantee using R1 techniques on more compute would follow the scaling law too. It’s not either or.
Well the uncensored fine tuning dataset is oss
Nah, o1 has been out how long? They are already on o3 in the office.
It’s completely normal a year later for someone to copy their work and publish it.
It probably cost them less because they probably just distilled o1 XD. Or might have gotten insider knowledge (but honestly how hard could CoT fine tuning possibly be?)
Yes but also it’s open source soooo
https://huggingface.co/mradermacher/DeepSeek-R1-Distill-Llama-70B-Uncensored-i1-GGUF
Good catch, that’s probably what’s happening then
deleted by creator
Tech unions now