LM Studio Quick Start - 5 Minutes to Local AI! 🚀
What You Get
- ✅ Run AI locally on your machine
- ✅ Complete privacy - no data sent to cloud
- ✅ No API costs - completely free
- ✅ Works offline after setup
- ✅ Same features as Azure OpenAI
Step 1: Download LM Studio (2 min)
- Go to https://lmstudio.ai/
- Click Download for your OS (Windows/Mac/Linux)
- Install and launch LM Studio
Step 2: Get a Model (3 min)
Recommended Starting Model: Hermes-2-Pro-Mistral-7B
In LM Studio:
- Click the 🔍 Search icon (top left)
- Search for:
hermes-2-pro - Find:
NousResearch/Hermes-2-Pro-Mistral-7B-GGUF - Click Download next to
Q4_K_Mversion (~4GB) - Wait for download to complete
Alternative Quick Models:
- Fast & Lightweight:
Phi-2-GGUF(~1.6GB) - Great for testing - Best Quality:
Mixtral-8x7B-Instruct-GGUF(~26GB) - Needs 32GB+ RAM
Step 3: Start LM Studio Server (30 sec)
- In LM Studio, click 💻 Local Server tab (left sidebar)
- Select your downloaded model from the dropdown
- Click Start Server button
- Wait for “Server started on http://localhost:1234”
- Keep LM Studio running!
Step 4: Configure FlowVision (1 min)
Option A: Using the UI (Easiest)
- Open FlowVision
- Go to Settings → LM Studio Configuration
- Check ☑ “Enable LM Studio”
- Click “Test Connection” (should show ✓ success)
- Click “Save”
Option B: Manual Configuration
Create file: %APPDATA%\FlowVision\lmstudioconfig.json
{
"EndpointURL": "http://localhost:1234/v1",
"ModelName": "local-model",
"APIKey": "lm-studio",
"Enabled": true,
"Temperature": 0.7,
"MaxTokens": 2048,
"TimeoutSeconds": 300
}
Step 5: Test It! (30 sec)
- In FlowVision, type: “What is 2+2?”
- You should see: “Local AI response” indicator
- Get a response from your local model!
✅ Success Indicators
You’ll know it’s working when you see:
- ✅ “LM Studio Action Execution” in the task indicator
- ✅ “Processing your request with local AI” message
- ✅ “Local AI response” during generation
- ✅ Responses coming without internet connection
🎯 Try These Commands
Simple Test
"Tell me a joke"
Tool Calling Test
"Open Notepad"
"What windows are open?"
"Create a file called test.txt with the content 'Hello World'"
Code Generation
"Write a Python function to calculate fibonacci numbers"
🐛 Troubleshooting
“Cannot connect to LM Studio”
Fix:
- Make sure LM Studio is running
- Check that server is started (green “Stop Server” button visible)
- Verify endpoint:
http://localhost:1234/v1
Model is too slow
Fix:
- Try a smaller model (Phi-2)
- Enable GPU acceleration in LM Studio settings
- Close other applications
- Reduce Max Tokens to 1024
“Connection timeout”
Fix:
- First request is slower (model loading)
- Increase timeout in config (300 → 600 seconds)
- Wait a bit longer
- Check CPU/RAM usage
Test Connection shows error
Fix:
- Restart LM Studio server
- Reload model in LM Studio
- Check firewall isn’t blocking port 1234
- Try changing port in LM Studio server settings
📊 Performance Tips
For Faster Responses:
- Temperature: 0.2 (more focused)
- Max Tokens: 1024 (shorter responses)
- Model: Use Phi-2 or TinyLlama
For Better Quality:
- Temperature: 0.7-0.8 (more creative)
- Max Tokens: 2048-4096 (longer responses)
- Model: Use Mixtral or Llama-2-13B
For Tool Calling:
- Best Model: Hermes-2-Pro-Mistral-7B
- Temperature: 0.3-0.5
- Max Tokens: 1024
🔄 Switching Between Local and Azure
Enable Local AI:
Check ☑ “Enable LM Studio” in settings
Disable (Use Azure):
Uncheck ☐ “Enable LM Studio” in settings
No restart needed! FlowVision automatically switches.
💡 Pro Tips
- Keep LM Studio running - Don’t close it while using FlowVision
- First request is slow - Model loading takes time, be patient
- Monitor resources - Check Task Manager for CPU/RAM usage
- Try different models - Each has different strengths
- Use GPU - Enable in LM Studio settings for huge speed boost
📱 Quick Reference
| Setting | Recommended | Fast | Quality |
|---|---|---|---|
| Model | Hermes-2-Pro-7B | Phi-2 | Mixtral-8x7B |
| Temperature | 0.5 | 0.2 | 0.7 |
| Max Tokens | 2048 | 1024 | 4096 |
| RAM Needed | 16GB | 8GB | 32GB+ |
🎉 That’s It!
You’re now running AI locally!
Next Steps:
- Try different models
- Experiment with settings
- Test tool calling features
- Read full documentation: LM Studio Integration Guide
🆘 Need Help?
- Check full guide: LM Studio Integration Guide
- LM Studio docs: https://lmstudio.ai/docs
- LM Studio Discord: https://discord.gg/lmstudio
- GitHub Issues: (your repo)
Enjoy your private, local AI! 🎊
No more API costs! No more cloud dependency! Complete control! 🚀