Liquid AI reveals 8B-A1B MoE trained on 38T

asb · 2026-05-30T15:12:04 1780153924

Beware the license. They misleadingly state on the blog post "Open-weight — Download, fine-tune, and deploy without restrictions". But if you read their license <https://huggingface.co/LiquidAI/LFM2.5-8B-A1B/blob/main/LICE...> it has significant restrictions for any org with other $10M in revenue.

onlyrealcuzzo · 2026-05-29T22:48:43 1780094923

I just tested this on a bug fixing benchmark I'm working on.

It did not perform as well as I expected. Qwen2.5-Coder-3B (2 years old) outperformed it by a wide range -> fixing ~50% of bugs whereas this model only fixed ~12%.

Granted, it's not a coder specific model, but given its benchmark performance to Gemma models, and that it's two years newer, and that it's an MoE with 8B total params, I expected it to be more competitive.

walrus01 · 2026-05-30T01:40:03 1780105203

I personally find any model smaller than something like Qwen 3.6 35B-A3B (8-bit quantization, about 49GB memory usage when loaded into llama.cpp) to be too "stupid" for reliable use.

I would much rather not run the model on my local laptop hardware and offload that to some system sitting under my desk in my home office, accessible via VPN, than take the risk of using an unreliable and flaky tool for the convenience of having it on the same hardware on my lap.

I pay very little attention to 8 billion or whatever (or even much smaller) models these days and I don't feel like I'm missing much.

satvikpendem · 2026-05-30T02:31:05 1780108265

Qwen 3.6 27B dense is much better than the 35B MoE model for coding, not sure if you've tried that yet.

walrus01 · 2026-05-30T02:33:37 1780108417

yes, I have, I use both. 27B slower in tok/s due to density, obviously, 35B-A3B for speed on simpler tasks.

intothemild · 2026-05-30T09:59:49 1780135189

You should enable MTP now that its available.

LLamaCPP has had some massive updates in the last week or so.

npodbielski · 2026-05-30T14:05:01 1780149901

Yes, Qwen 3.6 MoE is hitting like 80-90tk/s on Strix halo. On R9700 I had like 170t/s. It was not possible to keep up. But MoE is circling very often. I switch then to dense model and have 20-30t/s but it is able to solve quite a lot of tasks.

alfiedotwtf · 2026-05-30T17:45:38 1780163138

For those speeds, I’m assuming Q4?

npodbielski · 2026-05-31T06:05:05 1780207505

Ud_Q4_k_xl

intothemild · 2026-05-30T14:32:00 1780151520

I get 50-60t/s tg on my r9700 with the dense, unsloth MTP quant UD-Q5_K_XL, K@8/V@4 256k context.

Using Vulkan backend.

``` llama-server -fa on -t 7 -ngl 999 --mlock --fit off --kv-offload --no-webui --metrics --chat-template-kwargs {"preserve_thinking": true} -b 2048 -ub 1024 -m /mnt/models/unsloth/Qwen3.6-27B-MTP-GGUF/Qwen3.6-27B-UD-Q5_K_XL.gguf --mmproj /mnt/models/unsloth/Qwen3.6-27B-MTP-GGUF/mmproj-F16.gguf -c 262144 --kv-unified -ctk q8_0 -ctv q4_0 --spec-type draft-mtp --spec-draft-n-max 3 --spec-draft-ngl 99 --alias unsloth/Qwen3.6-27B-MTP-GGUF --temp 0.60 --top-k 20 --top-p 0.95 --min-p 0.00 --presence-penalty 0.00 --repeat-penalty 1.00 ```

sheeshkebab · 2026-05-30T15:40:21 1780155621

27b is slow as molasses vs 35b on local stuff I have (m5 max). Mtp doesn’t make any difference either.

theanonymousone · 2026-05-30T06:35:02 1780122902

Have you seen the 8bit quantisation matter a lot? The "consensus" in r/LocalLlama is that up to 4 bits the loss is tolerable.

walrus01 · 2026-05-30T06:55:50 1780124150

Absolutely. Difference in Q6 vs Q8 is not as immediately noticeable, but if I test by starting from a blank slate context and giving it the same complicated task with Q4 vs a Q8 GGUF file loaded, the difference is apparent. The Q4 will struggle or do 'stupid' things with even simple bash or python. Q4 might not be as noticeable for conversational purely text one on one interaction with an LLM, but when you dig deeper into something that's more esoteric in a training dataset than a chat conversation, absolutely a big gap there.

I think some of the folks in the local llm social media communities are using them for things like company-hosted customer service chat bots, or purely english text writing stuff where Q4 will probably not cause a problem. For more discrete technical work I stick pretty much exclusively to Q8.

theanonymousone · 2026-05-30T09:45:49 1780134349

Thanks a lot. How about Q8 vs FP16/BF16? Have you checked them too?

walrus01 · 2026-05-30T21:32:38 1780176758

I have not spent a lot of time running FP16 'full precision' versions of some things, but as the other commenter says, it's not much difference. There's a really wide array of benchmarks and tests from a lot of third parties unrelated to the trainer of the AI models that shows at most a two percent difference in score and capability between BF16 and Q8.

bradfa · 2026-05-30T12:35:33 1780144533

Q8 quant is very minimal fall off in terms of KLD against the lab 16 bit. If you have the memory for BF16 KV-cache (which is usually easier to stomach) then the Q8 is very close. But even Q8 quant model with Q8 KV-cache is very close.

Smaller quants for the model start to fall off but more importantly, smaller KV-cache quants fall off much faster so avoid less than Q8 there.

alfiedotwtf · 2026-05-30T17:47:28 1780163248

It’s not a general rule, and depends highly on the model and the quantisation used. Don’t guess, Unsloth sometimes publish graphs in their tutorials showing the error rate vs file size… sometimes Q4 is great, other times I go for Q6

thot_experiment · 2026-05-30T02:39:05 1780108745

q6 is fine for that qwen with ctx @ q8, and the dense models of that size are solid at q4 with q8 ctx

debazel · 2026-05-30T00:17:03 1780100223

I tried it with OpenCode and it is borderline incapable of using tool calls, so that might be why it is doing so bad on your test.

peder · 2026-05-30T00:48:43 1780102123

I just did the same. Absolutely awful. I assume OpenCode's heavy context is a problem, and it's probably better to use Liquid's own OpenCode alternative for this.

solarkraft · 2026-05-30T09:24:03 1780133043

Where can I find that agent harness? A look at their Docs and asking Gemini yielded no results.

Edit: Is it this? https://github.com/Liquid4All/cookbook/tree/main/examples/lo...

FYI: Opencode is very well tuned for Qwen models, but I haven’t found it that rare for niche models to perform badly in it.

h14h · 2026-05-30T15:07:43 1780153663

That's not all that surprising, IMO. From what I understand, LiquidAI is focusing pretty narrowly on building models that operate as the "agentic core" of a larger system.

If I were going to use this model, I'd be looking to use it more as is the primary chat interface of a larger system, and having it orchestrate & delegate tasks to other places via tool calls. It's not quite as exciting on the surface as a local "do it all" model, but it does enable some pretty neat use-cases, IMO.

I'm imagining a local agent that is super low latency, works entirely offline, and capable of queuing up complex tasks for larger/smarter cloud agents which execute them asynchronously.

onlyrealcuzzo · 2026-05-30T18:13:11 1780164791

Interesting...

Two of the other responses speak about it being abysmal at tool calling.

Overall, I'm pretty impressed a model this small can find/fix ~12% of bugs with crappy context - even if they're about as easy as possible to fix.

I just assumed it would perform better, given all the advancements in the space.

It's possible 1B active parameters is just not enough - even if it has 8B params of knowledge to reason through bugs.

Playing around with the context I fed it, it was able to fix up to ~34% of bugs vs ~46% for Qwen2.5-Coder-3B and ~54% for Qwen2.5-Coder-7B.

XCSme · 2026-05-30T00:33:10 1780101190

I will test it when it's accessible via OpenRouter, but the previous LFM2 model (lfm-2-24b-a2b) didn't do well on my tests, it got only 1/20 questions/tasks right, way below Gemma 31B or Qwen 35b-a3b (those get like 10/20 right)

BoorishBears · 2026-05-30T13:57:57 1780149477

I tested it against Gemma 4 31B and it's expectedly not favorable for world knowledge.

But even against E4B it's shaky, which is surprising given how many tokens they trained on. I guess it was on a lot of synthetic data.

mike_hearn · 2026-05-30T15:40:14 1780155614

It's not intended to be a coding model, however.

HanClinto · 2026-05-29T22:56:02 1780095362

Some of the coding-specific fine-tunes were really impressive boosts. Qwen2.5-3B-Instruct is also available [0] -- if it's not too much to ask, I'd be curious how more general models stack up in your benchmark?

[0] - https://huggingface.co/Qwen/Qwen2.5-3B-Instruct

mlmonkey · 2026-05-29T19:50:58 1780084258

Question: I have a dirty car and the car wash is just 50 meters away. Should I walk or drive to the carwash?

Answer: . . . . So, unless you have a compelling reason not to, walk to the car wash.

nine_k · 2026-05-29T20:23:40 1780086220

You did not express any desire to wash the car. After nudging, it's funnier:

> Is my desire to make my car clean related to a car wash in any way? To driving to it?

> Your desire to have a clean car isn’t tied to the existence or use of a car wash or to traveling to one. It stems from how much you value the effort, time, and resources you’re willing to invest in achieving cleanliness — whether that means doing it yourself, paying for a professional service, or simply enjoying the routine of a quick walk‑out. Driving to a nearby wash is just one possible method among many; the decision ultimately depends on factors such as how much time you have, how much you’d like to spend, the quality of the wash, and any personal preferences you have for hands‑on versus outsourced cleaning. In this case, walking is the simplest and fastest way, but it isn’t required for you to achieve a clean car.

Common sense is clearly there, but we should not underestimate the colossal heap of tacit assumptions that drive "obvious" decisions in our daily life.

enaaem · 2026-05-30T00:13:21 1780100001

The AI would be an awful husband

andai · 2026-05-29T21:51:16 1780091476

Reminder that the human baseline for the car wash question is also pretty bad.

MPSimmons · 2026-05-29T23:58:01 1780099081

Oh, I'm interested - do you have any docs with human responses to that?

andai · 2026-05-30T02:24:42 1780107882

“Car Wash” test with 53 models

https://news.ycombinator.com/item?id=47128138

This article has a graph of the human response rates. About 70% correct on average. Accuracy depends on the country (maybe a language barrier?).

See also original thread on the car wash thing.

I want to wash my car. The car wash is 50 meters away. Should I walk or drive?

https://news.ycombinator.com/item?id=47031580

yieldcrv · 2026-05-30T05:35:04 1780119304

this reminds me, I grew up in an area of the US where the pinnacle of existence was spending the whole weekend doing chores such as very publicly washing your own car in your driveway

if you were an able bodied man there is no other duty. the same for shoveling snow, or mowing a lawn, cleaning up inside the house

these are all things I've rejected and exempt myself from

but I'm beginning to remember large swaths of society live under that regime, so driving to a car wash wouldn't be an option at all. you wash your car and have a separate desire to walk to the car wash for some other reason

I could see people thinking its a trick question, or just scoffing at the idea people wash their cars at the car wash and pollute the data for AIs in annotation work.

rbanffy · 2026-05-30T09:23:59 1780133039

Sometimes I miss washing my car on the driveway. I guess I’m far less emotionally attached to my car now than I was in the 1980s.

roenxi · 2026-05-30T15:39:13 1780155553

"Correct" is pushing it, the question is too vague if approached as a genuine question and not a gotcha. I've actually had literal experiences where I wanted to wash my car and walked to a car wash in the past. That was me collecting the car, and there is an argument that would be a valid walk answer.

If we require logical rigour there isn't enough context in the question. If we allow for informal language then there are absolutely situations where cars get washed and people walk 50 meters to the car wash. It is a reasonable guess that the car is already at the wash and you have a 2nd car, given the question is being asked. It's a slight leap, but it is an inference that makes the question meaningful and so it is one that could be made.

I'd assume the LLMs are just failing at spatial reasoning, because AFAIK they're terrible at it. But both answers are justifiable because we don't know where the car is and have to make assumptions.

HappMacDonald · 2026-05-31T02:56:31 1780196191

Well presumably it's simply better for the environment and for your own health to just carry the car with you

cwnyth · 2026-05-29T20:05:00 1780085100

I'm surprised these models haven't picked this up yet in the training data. Both Claude and ChatGPT missed that one when I posed the question to them last year.

treis · 2026-05-29T22:21:33 1780093293

ChatGPT still says walk but adds:

>The main reasons to drive such a short distance would be if you're bringing the car specifically to be washed, carrying something heavy, or the weather or walking conditions make it impractical.

>If your goal is to get your car washed, you'll need the car there—so driving makes sense. If you're just going to talk to someone at the car wash or check it out, walking is probably faster.

tingletech · 2026-05-29T20:30:47 1780086647

Why would a model know that one washes cars at a car wash? We don't clean our bodies at the body wash or clean the kitchen at the kitchen wash.

nullpoint420 · 2026-05-30T07:56:24 1780127784

Ok im supposed to assume that a model doesn’t know cars get washed at a car wash?

But then im supposed to give it access to write code in my repositories. Sorry, what are you trying to get at here?

shepardrtc · 2026-05-29T20:47:09 1780087629

There's meaning in the term "car wash" that it understands. But I don't suspect anyone has taught it that for 99.9% of people, going to car wash ONLY means that you're going to wash your car and that it should make that implicit assumption.

What if you're the car wash owner? Or a maintenance technician? Pretty easy to just walk over there if you're just 50ft away.

jjtheblunt · 2026-05-29T20:52:15 1780087935

to your point, when my Aussie friends first mentioned a "car park" to my north american born self, i wondered _momentarily_ what that was, then realized it's sort of a fun name for what i would call a parking lot.

nl · 2026-05-30T00:57:16 1780102636

I've never thought of it as a fun term before.

We use "park" as "I will park the car" not park as in "amusement park"

BobbyTables2 · 2026-05-30T04:04:00 1780113840

Why isn’t the “pantry” called the “food store”?

justinclift · 2026-05-30T13:05:20 1780146320

Does your pantry have a cashier and let you buy stuff there?

Because a food store sounds like it does.

jjtheblunt · 2026-05-30T01:30:19 1780104619

yeah but syntactically "car park" gets used like a noun phrase, not verb phrase, which was (to your point really) what had me think "huh?" momentarily.

SequoiaHope · 2026-05-29T21:32:10 1780090330

Every model knows what a car wash is.

purerandomness · 2026-05-29T23:28:48 1780097328

If it doesn't, what's the point using it? Trusting it with your workflows, your code?

sroussey · 2026-05-29T21:06:27 1780088787

I walk to the gas station more often than I drive there.

deklesen · 2026-05-29T21:28:25 1780090105

Yeah, but you are not washing yourself there, I suppose?

The whole twist here is that to wash your car, you need your car, so you cannot go by foot.

strangegecko · 2026-05-30T00:05:03 1780099503

His analogy is that a gas station is for putting gas into your car. But he walks there often, so the assumption that you need your car if you go to the gas station isn't inevitable.

You could conceivably walk to a car wash that has similar sundries as a gas station.

sroussey · 2026-05-30T00:31:45 1780101105

Indeed, the little market there is why I walk there. There is also one at the car wash another 2 blocks away. I’d walk there for a 7up if it were closer!

dominotw · 2026-05-29T20:29:15 1780086555

doesnt seem unreasonable.

halJordan · 2026-05-29T20:59:03 1780088343

These faux questions always have a valid interpretation that the asker doesn't admit (for some reason). The model is then castigated for not making an opinionated choice

kennywinker · 2026-05-29T21:46:39 1780091199

That’s not what’s happening.

The question is revealing that the model has a model of language but not of reality. It knows what words go together, but not real-world concepts.

ahoka · 2026-05-30T10:03:03 1780135383

This. LLMs are marketed on the false premise of all knowledge, intelligence and wisdom being possible to be encoded in language only.

riversflow · 2026-05-30T19:23:53 1780169033

lol, i think the LLM shows more wisdom here than the average person. Functionally, being 50m away from the car wash is at the car wash if you have a dirty car in your possession that needs cleaning. Realistically, the only reason you express the need to go to the carwash if you are in a 50m proximity with your car you intend to clean at the carwash is if you need to walk in and talk to someone.

dd8601fn · 2026-05-30T03:22:18 1780111338

As a test, explaining away peculiar answers by imagining unlikely outlier scenarios is not the counter you seem to think it is.

For most of them, we’d worry that a human answerer using maximum effort to produce the same outcome was having a stroke.

m463 · 2026-05-30T04:14:17 1780114457

maybe unreasoning.

also, naysayers apparently DO have a compelling reason.

2001zhaozhao · 2026-05-30T06:18:12 1780121892

At some point we have to be running into some inherent mathematical limits of knowledge compression, right? No way the knowledge benchmarks on these 8B models will keep getting better without overfitting on these benchmarks

yorwba · 2026-05-30T06:51:02 1780123862

If you give the model access to specialized tools (e.g. web search for question answering) the knowledge doesn't have to be stored in the model weights, which leaves some room for improvement. You'd still be overfitting to benchmarks (since different tasks might require different tools) but not necessarily to specific benchmark questions, so within-domain generalization could be quite good.

As an example for a similar approach, Teapot AI has trained very small models https://teapotai.com/models to only answer questions where the answer can be found within the context window, and although not perfect, they do quite well at this compared to larger, more general models.

geek_at · 2026-05-30T12:58:01 1780145881

good point I have the feeling larger models (20b+) rely too much about their stored knowledge and sometimes fail to use tools because they think they know the answer. smaller specialized tool calling models could be the smart route for the future

Woodi · 2026-05-31T12:48:40 1780231720

Yea, it's strange all that all possible books stealing movement and then lobbying for law prohibiting... something.

Humans train "thinking methodology" first and then know how to use it while accessing data and to build knowledge.

Humans do not memorize at once all text in existence, that's totally stupid.

Already thinking humans specialize in disciplines: math, chemistry, IT, cooking, etc while still using new data.

All of that computing is local- on the LAN of the brain.

So if some "agents" wants to help then there is zero need for computation outside of home/corporation/car local area network.

Licenses ??

SubiculumCode · 2026-05-29T20:07:40 1780085260

Anybody use their localcowork [1] before? That is where the demo lives. Or not?

[1] https://github.com/Liquid4All/cookbook/tree/main/examples/lo...

adityashankar · 2026-05-29T19:15:43 1780082143

This is super interesting, I'm particularly excited for this one as it may allow teams to scale this architecture for VLAs (vision language action models), and having sparser models means more real-time actions on a locally hosted model

demo link for anyone that wants to try this out https://playground.liquid.ai/chat?model=cmppnbgse000004l4bc8...

HappMacDonald · 2026-05-31T15:17:47 1780240667

Neither of the VL models work for me in playground though, they just error out

Ifkaluva · 2026-05-29T21:40:11 1780090811

Liquid does amazing work, but I kinda feel like they are overtraining their models. 38T tokens seems like a lot for an 8B model

andai · 2026-05-29T21:50:08 1780091408

What's the downside? Don't they stop when they hit diminishing returns?

Ifkaluva · 2026-05-30T00:49:22 1780102162

You’d think so, but I haven’t seen it explicitly discussed in their papers, and nobody else that I know of trains on that many tokens

hgoel · 2026-05-30T16:39:00 1780159140

Wouldn't the model start overfitting at some point? Degrading generalization for accuracy on the training set.

chabes · 2026-05-29T19:52:32 1780084352

The small models are getting really impressive.

I recently realized that Qwen3.5:4B is way more capable than I thought a model that size could be.

Combine that with the work Liquid puts into RL and fine tuning, and you get models that perform extremely well on minimal hardware.

Combine that with your own fine tuning, and you get a specialized tool that is fast, private, and doesn’t require internet connection.

r0b05 · 2026-05-29T19:57:33 1780084653

What did you use qwen3.5 4b for?

steve_adams_86 · 2026-05-29T22:19:13 1780093153

I use it for triaging my messages and emails and reminding me how all of it ties together. It uses Obsidian to know where to put stuff and how to connect information. It isn't perfect. It's very slow (using a 32GB M2 Max) but fast enough for my needs.

A good example of how it's helpful is that it will make certain things relatively frictionless. Like, I need to pay property taxes. I hate this stuff. I got the email reminder from my municipality and it made an entry in my TODOs which points to page with instructions to pay the taxes, including my folio and access numbers for when I log in. That was taken from the email and a document which contains past property tax information. I have it all there, but it compiles relevant data into dedicated TODO pages.

I'm so bad at doing all of this myself. I really don't enjoy it. Send me to buy a carrot at the store and I'll happily walk 30 minutes there and back to do it. It isn't the effort so to speak; it's how unrewarding, inefficient, and bureaucratic it all is. I'm allergic to it. Why isn't it baked into my income taxes? Why are we still doing this?

Sometimes it does a really bad job of making TODOs. Like my wife messaged me about what our dinner plan was, so Qwen went ahead and made a plan for chicken meatball soup based on messages from a week earlier. It totally fabricated the recipe. Yet, I don't know, it was still helpful to be reminded that I'm in charge of dinner.

It's probably best at scaffolding responses to emails I don't want to send. I will write it, but I appreciate basic information being fleshed out so I can write it without jumping around looking for files or numbers or whatever constantly.

I use it with a custom harness. It could be a lot better. Everything about it could be better. The model is remarkably good for its size and price, though.

Letting Sonnet 4.6 do it instead always yields much better results, much faster, but it's kind of like using a new phone vs a super old one. They can both get you there. The sound quality and camera might be worse, it doesn't look as fancy, but the new one is $1200 and the old one is free on marketplace if you're handy with a screwdriver and a fresh battery. Sounds great to me

Worth noting: this was all vibe-coded using Opus 4.6 and 4.7. It's the only project I've built that is strictly vibe-coded. It's simultaneously exciting and disgusting. I'm not sure if I'll ever 'software engineer' it, or I'll just let it be slop. It works.

cjtrowbridge · 2026-05-29T21:22:05 1780089725

its really good at agentic tasks

sroussey · 2026-05-29T21:04:59 1780088699

I find it works well in the browser.

irthomasthomas · 2026-05-29T21:42:18 1780090938

Woah, chinchilla scaling is 20 x active_params. I think mistral was 2 x Chinchilla. This is 1800 x

frankdlc222 · 2026-05-29T23:40:02 1780098002

Look at the accuracy numbers and these things clearly don't know much yet, and I'm not about to hand one my hardest work. But you can see where it's going. As quantization and the MoE stuff keeps getting better, "good enough to just run on my own machine" keeps eating into more of what I'm currently paying a frontier lab for. Once a local model can handle like 80% of what I need, the math stops making sense for the subscription.

kilroy123 · 2026-05-29T21:57:03 1780091823

Hmm, I asked it who made it, and it says Google?

pure_magic · 2026-05-30T02:23:06 1780107786

Many such cases. Many models say they're ChatGPT, a lot seem to figure out that since they're Transformers they're made by Google. Doesn't really tell you a lot. Perhaps a pretraining / midtraining artifact.

ramshanker · 2026-05-29T19:46:45 1780084005

Guess we can run this even on CPU!

bee_rider · 2026-05-29T19:48:26 1780084106

They seem… much better than all the models they compared against? What’s the catch?

FuckButtons · 2026-05-29T20:18:49 1780085929

They only showed the benchmarks where they outperformed?

andai · 2026-05-29T21:51:48 1780091508

It's twice the size?

elorant · 2026-05-29T19:41:06 1780083666

Wow, this is fucking phenomenal. I fed it a long transcript asking it to create a summary and it executed it extremely well. For an 8B model this is quite impressive.

SubiculumCode · 2026-05-29T20:25:39 1780086339

I gave it a 2000 line python code that does some fairly sophisticated geodesic calculations on surfaces, and asked to review the code. I then asked Claude and ChatGPT to "assess the accuracy of this review" and they did not hold back. That said, its a very small model, and very fast.

ValdikSS · 2026-05-30T08:58:21 1780131501

Bad at translation, at least to Russian. Very fast though, about 2x faster than Gemma 4 e2b on my CPU.

feelingsonice · 2026-05-30T03:11:32 1780110692

Is Liquid AI still using the liquid neural network architecture?

jauntywundrkind · 2026-05-29T22:13:44 1780092824

I really love how fast it is! Their press release comparing it on Strix Halo and M5 Max are impressive. It going twice as fast at GPU benchmarks even more so!

grigio · 2026-05-30T07:58:09 1780127889

I tested the previous model from Liquid, unfortunatly big claim but poor real performance

HenryMulligan · 2026-05-29T19:29:43 1780082983

Why does this not have (day-one) support for Ollama? The previous model is on there? Is it related to the ongoing refactor work or are people abandoning Ollama for other LLM engines?

TobTobXX · 2026-05-29T19:31:46 1780083106

Ollama is just llama.cpp but with their own interface ontop. Liquid does support llama.cpp, but Ollama is slow in updating its llama.cpp dependency.

garo-pro · 2026-05-29T19:46:34 1780083994

It does, ollama pull maternion/lfm2.5

gmuslera · 2026-05-29T19:39:22 1780083562

Homeopathic AI

nickpsecurity · 2026-05-30T03:41:58 1780112518

I'd normally call that a low-effort, troll comment. But, thinking on it, you may have a great metaphor.

They keep promising great performance out of models whose key ingredient (parameters) they are diluting. Many seem to be in a competition saying they're getting smaller and higher performance at the same time. Then, the homeopathic models don't perform as well as real models when independently tested. Again, spot on.

zmmmmm · 2026-05-29T21:42:36 1780090956

No vision support?