LM Studio

by Element Labs

Desktop app to discover, download, and run open-weight LLMs locally, with a local API

FrameworkAssistant

Last reviewed 2026-06-20

LM Studio is a desktop application for running open-weight large language models locally and privately on your own hardware. It bundles a model browser for downloading models, an in-app chat UI, and a local inference engine, running GGUF models via llama.cpp on macOS, Windows, and Linux and Apple MLX models on Apple Silicon. It also exposes a local server with an OpenAI-compatible REST endpoint (default port 1234), a command-line tool (lms), Python and JavaScript SDKs, and a headless build (llmster) for servers and CI. LM Studio is local-inference infrastructure plus a chat client, not an autonomous agent itself. It serves models and passes through model-level features such as tool/function calling, structured (JSON-schema) output, and attaching documents to chat for offline RAG, and it acts as an MCP client so you can install Model Context Protocol servers and use them with local models. Developers point IDE assistants, agent frameworks, and LLM apps at the local LM Studio endpoint to get private, offline inference. It is built by Element Labs and is free for both personal and work use, with a separate Enterprise tier for centralized administration.

What it can do

Discover, download, and run open-weight LLMs locally
Assistant
In-app model browser to find and download open-weight models (gpt-oss, Llama, Qwen, Mistral, DeepSeek and others), then run them locally; supports GGUF models via llama.cpp on all platforms and Apple MLX models on Apple Silicon, per the official docs.
source
Local OpenAI-compatible server and SDKs
Assistant
Runs a local server exposing an OpenAI-compatible REST endpoint (default port 1234) plus an LM Studio REST API (beta), an lms CLI, Python and JavaScript SDKs, and a headless build (llmster) for servers and CI, so existing client code can target a local model.
source
Tool calling and structured (JSON-schema) output
Assistant
Passes through model-level tool/function calling via the chat completions endpoint and supports structured output constrained to a JSON schema through response_format and the SDKs; LM Studio serves these, it does not act on them itself.
source
Offline RAG and MCP client
Assistant
Lets you attach documents to a chat and interact with them entirely offline (RAG), and acts as an MCP client so you can install Model Context Protocol servers and use them with local models, per the official docs.
source

Strengths

+Polished cross-platform desktop GUI (macOS, Windows, Linux) with a built-in model browser and chat, easier for non-CLI users than raw runtimes
+Runs both GGUF (via llama.cpp) and Apple MLX models, and exposes an OpenAI-compatible local server plus Python/JS SDKs
+Free for personal and commercial use (since July 8, 2025), private and offline by default, with an MCP client built in

Limitations

−Infrastructure and chat client, not an agent: it serves models but does not plan, act, or orchestrate on its own
−The core desktop app is not open source (unlike some local-runtime peers), though its SDKs and CLI are on GitHub
−Performance and model quality are bounded by local hardware

Overview

LM Studio is a desktop application for discovering, downloading, and running open-weight large language models locally and privately. It bundles a model browser, an in-app chat UI, and a local inference engine. It runs GGUF models via llama.cpp on macOS, Windows, and Linux, and Apple MLX models on Apple Silicon. It also exposes a local server, an lms CLI, Python and JavaScript SDKs, and a headless build (llmster) for servers and CI. It is built by Element Labs (founded 2023, New York).

What it does

From the in-app browser you download models (gpt-oss, Llama, Qwen, Mistral, DeepSeek, and more) and chat with them offline. The local server exposes an OpenAI-compatible REST endpoint (default port 1234) plus an LM Studio REST API (beta), so existing client code can target a local model with a one-line base-URL change. LM Studio passes through model-level tool/function calling and structured (JSON-schema) output, lets you attach documents for offline RAG, and acts as an MCP client so you can install Model Context Protocol servers and use them with local models. It serves these capabilities; it does not act on them itself.

Integrations & setup

Install the macOS, Windows, or Linux build, then download a model from the in-app browser and load it. Use the chat UI, or start the local server and point an IDE assistant, agent framework, or LLM app at the OpenAI-compatible endpoint. The lms CLI, Python and JavaScript SDKs, and the headless llmster build cover scripting and server deployments. Models come from the Hugging Face ecosystem in GGUF and MLX formats.

Pricing

LM Studio is free for both personal and work use. Element Labs removed the prior separate commercial-license requirement on July 8, 2025, per its blog. A separate Enterprise plan (with features such as SSO and model/MCP gating) is available for organizations that need centralized administration, contact sales for pricing.

Best for / not for

Best for developers and teams who want a polished desktop app for private, offline inference over open-weight models, including non-CLI users who prefer a GUI, and as a local backend for agents and IDE assistants. Not a turnkey agent or hosted product, the core app is not open source, and performance is bounded by local hardware.

Alternatives

Ollama is the closest peer as a local model runtime with an OpenAI-compatible API, differing mainly in being open source and CLI-first versus LM Studio's GUI-first desktop experience. For building agents and RAG on top of a local model, framework-level tools such as LlamaIndex apply.

What people are saying

We aggregate real LinkedIn discussion into sentiment for the agents people search most. LM Studio isn't tracked yet, want it added? Request tracking.

FAQ

Is LM Studio free?+

Yes. LM Studio is free for both personal and work use; Element Labs removed the previous separate commercial-license requirement on July 8, 2025, per its blog. There is a separate Enterprise plan (with features such as SSO and model/MCP gating) for organizations that need centralized administration, contact sales for pricing.

Is LM Studio an AI agent?+

No. LM Studio is a desktop app for running open-weight LLMs locally plus a local OpenAI-compatible server. It passes through model features such as tool calling, structured output, and MCP, but it does not plan or act autonomously on its own. You build an agent on top by pointing a framework or assistant at the local LM Studio endpoint.

What models and formats does LM Studio run?+

Open-weight models such as gpt-oss, Llama, Qwen, Mistral, and DeepSeek. It runs GGUF models through llama.cpp on macOS, Windows, and Linux, and Apple MLX models on Apple Silicon Macs, per the official docs.

Sources

LM Studio (official site) · accessed 2026-06-20
LM Studio Docs (app) · accessed 2026-06-20
Tool Use (LM Studio developer docs) · accessed 2026-06-20
LM Studio is free for use at work (LM Studio blog) · accessed 2026-06-20

Last reviewed 2026-06-20

Alternatives & related

Ollama

Run open-weight LLMs locally with a CLI, REST API, and OpenAI-compatible endpoints

LlamaIndex

Open-source data framework for RAG pipelines and data-grounded agents

Continue

Open-source AI code assistant and agent for VS Code and JetBrains

Cline

Open-source autonomous coding agent for VS Code and JetBrains