⚡ A Transistor Corner Production // Host Node: ://webpagearea.com ⚡
The common consensus among hardware enthusiasts is that running massive large language models on twelve separate 8GB legacy GPUs over a decade-old motherboard is highly impractical, if not impossible. They are right on paper... but are they right? I decided to find out.
The Reality: By leveraging a custom parallel optimization layer, this rig successfully loaded and ran GPT-OSS 120B locally at 7.70 tokens/sec using ancient DDR3 and three cheap graphics cards.
⚡ [PHOTO] THE COMPLETED 12-CORE MATRIX IN THE RIG ⚡
Want to assemble this micro-cluster yourself? Track the components on eBay using the nodes below:
Disclaimer: As an eBay Partner, I earn from qualifying purchases made via the tracking nodes above at no additional cost to you. Supports the bench!
| Model Identity | Footprint | Token Gen Speed (Eval) |
|---|---|---|
| gpt-oss-120b:latest | 65 GB | 5.83 tokens/s 🚀 |
| qwen3.5-122b-a10b:latest | 81 GB | 2.71 tokens/s |
| gpt-oss-20b:latest | 13 GB | 8.14 tokens/s |
| deepseek-coder-v2-16b:latest | 8.9 GB | 12.38 tokens/s |
| llama3.3-70b:latest | 42 GB | 1.18 tokens/s |
YOU ARE VISITOR NUMBER:
Constructed with pure HTML/CSS. Best viewed in Netscape Navigator 4.0 or higher.
--------------------------------------------------
© 2026 Transistor Corner / Gary Davenport. All rights reserved.