> Sorry to shatter your bubble, but this is patently false, LLMs are far more ef... | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

woadwarrior01 67 days ago | parent | context | favorite | on: Ollama is now powered by MLX on Apple Silicon in p...

> Sorry to shatter your bubble, but this is patently false, LLMs are far more efficient on hardware that simultaneously serves many requests at once.

You might want to read this: https://arxiv.org/abs/2502.05317v2

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact