Hardware, built & shipped
From a silent Mac mini to multi‑GPU RTX 5090 workstations and RTX PRO 6000 Blackwell nodes — specced, assembled and burned‑in for inference.
Pico Software designs, builds and configures on‑premise hardware — and installs the full local‑LLM software stack — so companies and individuals can run today's best open‑weight models in‑house. No cloud. No prompts or documents leaving your network.
We supply the box and the brains: the right on‑prem hardware, the local‑LLM software stack, and the help to get it running for your team.
From a silent Mac mini to multi‑GPU RTX 5090 workstations and RTX PRO 6000 Blackwell nodes — specced, assembled and burned‑in for inference.
We install and tune the inference stack — open‑weight models, runtime, OpenAI‑compatible API and a chat UI — so your team can chat, code and build on day one.
On‑site or remote install, model selection, quantisation and ongoing support — sized to your workloads, context lengths and budget.
Everything runs inside your building. Prompts, documents and model weights stay on your network — ideal for regulated and privacy‑first teams.
Memory capacity decides which models fit; bandwidth decides how fast they generate. Each tier lists the largest open‑weight models that fit in memory at 4‑bit.
Indicative UK pricing (mid‑2026) and example open‑weight models — the largest that fit in memory at 4‑bit quantisation, where all parameters must fit in RAM/VRAM (Mixture‑of‑Experts models keep every parameter resident even though only a subset is active per token). Prices were moving month‑to‑month under a DRAM shortage; figures are from our June 2026 on‑prem LLM hardware report — please re‑check before ordering.
Run inference entirely on‑prem — no prompts, files or embeddings sent to third‑party APIs.
Memory capacity decides which models fit; we match the hardware to your target models and context lengths.
We focus on Apache 2.0 / MIT models you can deploy and build on commercially — and we flag the licensing traps.
Assembly, burn‑in, on‑site install and ongoing support — whether it's one desktop or a department server.
Tell us your use case and target models — we'll spec the right on‑prem build and quote it.
sales@pico-software.com