Hardware, built & shipped
From a silent Mac mini to multi‑GPU RTX 5090 workstations and RTX PRO 6000 Blackwell nodes — specced, assembled and burned‑in for inference.
Pico Software designs, builds and configures on‑premise hardware — and installs the full local‑LLM software stack — so companies and individuals can run today's best open‑weight models in‑house. No cloud. No prompts or documents leaving your network.
We supply the box and the brains: the right on‑prem hardware, the local‑LLM software stack, and the help to get it running for your team.
From a silent Mac mini to multi‑GPU RTX 5090 workstations and RTX PRO 6000 Blackwell nodes — specced, assembled and burned‑in for inference.
We install and tune the inference stack — open‑weight models, runtime, OpenAI‑compatible API and a chat UI — so your team can chat, code and build on day one.
On‑site or remote install, model selection, quantisation and ongoing support — sized to your workloads, context lengths and budget.
Everything runs inside your building. Prompts, documents and model weights stay on your network — ideal for regulated and privacy‑first teams.
Memory capacity decides which models fit; bandwidth decides how fast they generate. Each tier lists the largest open‑weight models that fit in memory at 4‑bit.
Indicative UK pricing (mid‑2026) and example open‑weight models — the largest that fit in memory at 4‑bit quantisation, where all parameters must fit in RAM/VRAM (Mixture‑of‑Experts models keep every parameter resident even though only a subset is active per token). Prices were moving month‑to‑month under a DRAM shortage; figures are from our June 2026 on‑prem LLM hardware report — please re‑check before ordering.
Run inference entirely on‑prem — no prompts, files or embeddings sent to third‑party APIs.
Memory capacity decides which models fit; we match the hardware to your target models and context lengths.
We focus on Apache 2.0 / MIT models you can deploy and build on commercially — and we flag the licensing traps.
Assembly, burn‑in, on‑site install and ongoing support — whether it's one desktop or a department server.
We're a small, focused UK team that designs on‑premise AI hardware and installs the full software stack — so you can run open‑weight LLMs locally without depending on cloud providers.
We spec the components, build and burn‑in every machine, install the inference stack and hand it over running. One team, no middlemen.
Pico Software is a trading name of Pina Colada Software Limited (Company 9605428), registered in England. All hardware is assembled and supported from the UK.
We started this business because we believe companies and individuals deserve to use powerful AI without handing their data to third parties.
Tell us your use case and target models — we'll spec the right on‑prem build and quote it.
sales@pico-software.com