Project Templates์ถ์ฒ: Show HN์กฐํ์ 8
Show HN: NSED is public โ Mixture-of-Models to Hit SOTA using self-hosted AI
By t_peersky2026๋
2์ 19์ผ
**Show HN: NSED is public โ Mixture-of-Models to Hit SOTA using self-hosted AI**
Hey HN, We're open-sourcing (source-available, BSL 1.1, patent pending) the orchestrator behind our paper benchmark results. NSED (N-Way Self-Evaluating Deliberation) is a Rust binary that coordinates multiple LLMs through structured rounds of proposals and cross-evaluation, using quadratic voting to prevent any single model from dominating the consensus.The result: Three open-weight models (20B, 8B, 12B) on consumer GPUs โ 64GB total VRAM, ~$7K hardware โ score 84% on AIME 2025. The same models individually or with naive majority voting score ~54%. That's frontier-model performance on hardware you can buy at Micro Center.How it works:Each agent independently proposes a solution Every agent evaluates every other agent's work Scores aggregate via quadratic voting (cost of influence grows quadratically โ no single model can dominate) Repeat. Agents see prior results, refine, re-evaluate System converges toward the highest-quality answer through adversarial cross-checkingIt's provider-agnostic โ mix Ollama, vLLM, OpenAI, Anthropic, or any OpenAI-compatible endpoint in the same deliberation...
---
**[devsupporter ํด์ค]**
์ด ๊ธฐ์ฌ๋ Show HN์์ ์ ๊ณตํ๋ ์ต์ ๊ฐ๋ฐ ๋ํฅ์ ๋๋ค. ๊ด๋ จ ๋๊ตฌ๋ ๊ธฐ์ ์ ๋ํด ๋ ์์๋ณด์๋ ค๋ฉด ์๋ณธ ๋งํฌ๋ฅผ ์ฐธ๊ณ ํ์ธ์.
Hey HN, We're open-sourcing (source-available, BSL 1.1, patent pending) the orchestrator behind our paper benchmark results. NSED (N-Way Self-Evaluating Deliberation) is a Rust binary that coordinates multiple LLMs through structured rounds of proposals and cross-evaluation, using quadratic voting to prevent any single model from dominating the consensus.The result: Three open-weight models (20B, 8B, 12B) on consumer GPUs โ 64GB total VRAM, ~$7K hardware โ score 84% on AIME 2025. The same models individually or with naive majority voting score ~54%. That's frontier-model performance on hardware you can buy at Micro Center.How it works:Each agent independently proposes a solution Every agent evaluates every other agent's work Scores aggregate via quadratic voting (cost of influence grows quadratically โ no single model can dominate) Repeat. Agents see prior results, refine, re-evaluate System converges toward the highest-quality answer through adversarial cross-checkingIt's provider-agnostic โ mix Ollama, vLLM, OpenAI, Anthropic, or any OpenAI-compatible endpoint in the same deliberation...
---
**[devsupporter ํด์ค]**
์ด ๊ธฐ์ฌ๋ Show HN์์ ์ ๊ณตํ๋ ์ต์ ๊ฐ๋ฐ ๋ํฅ์ ๋๋ค. ๊ด๋ จ ๋๊ตฌ๋ ๊ธฐ์ ์ ๋ํด ๋ ์์๋ณด์๋ ค๋ฉด ์๋ณธ ๋งํฌ๋ฅผ ์ฐธ๊ณ ํ์ธ์.
