Private LLM - Local AI Chat 12+

Name: Private LLM - Local AI Chat
Price: 4.99 GBP
Rating: 4.4 (23 reviews)
Author: Numen Technologies Limited

Local Offline Private AI Chat

Numen Technologies Limited

- 4.4 • 23 Ratings

- £4.99

Description

Meet Private LLM: Your Secure, Offline AI Assistant for macOS

Private LLM brings advanced AI capabilities directly to your iPhone, iPad, and Mac—all while keeping your data private and offline. With a one-time purchase and no subscriptions, you get a personal AI assistant that works entirely on your device.

Key Features:

- Local AI Functionality: Interact with a sophisticated AI chatbot without needing an internet connection. Your conversations stay on your device, ensuring complete privacy.

- Wide Range of AI Models: Choose from various open-source LLM models like Llama 3.2, Llama 3.1, Google Gemma 2, Microsoft Phi-3, Mistral 7B, and StableLM 3B. Each model is optimized for iOS and macOS hardware using advanced OmniQuant quantization, which offers superior performance compared to traditional RTN quantization methods.

- Siri and Shortcuts Integration: Create AI-driven workflows without writing code. Use Siri commands and Apple Shortcuts to enhance productivity in tasks like text parsing and generation.

- No Subscriptions or Logins: Enjoy full access with a single purchase. No need for subscriptions, accounts, or API keys. Plus, with Family Sharing, up to six family members can use the app.

- AI Language Services on macOS: Utilize AI-powered tools for grammar correction, summarization, and more across various macOS applications in multiple languages.

- Superior Performance with OmniQuant: Benefit from the advanced OmniQuant quantization process, which preserves the model's weight distribution for faster and more accurate responses, outperforming apps that use standard quantization techniques.

Supported Model Families:
- DeepSeek R1 Distill Based Models
- Phi-4 14B Model
- Llama 3.3 70B
- Llama 3.2 Based Models
- Llama 3.1 Based Models
- Google Gemma 2 Based Models
- Qwen 2.5 Based Models (0.5B to 32B)
- Qwen 2.5 Coder Based Models (0.5B to 32B)
- Solar 10.7B Based Models
- Yi 34B Based Models

For a full list of supported models, including detailed specifications, please visit privatellm.app/models.

Private LLM is a better alternative to generic llama.cpp and MLX wrappers apps like Ollama, LLM Farm, LM Studio, RecurseChat, etc on three fronts:
1. Private LLM uses a faster mlc-llm based inference engine.
2. All models in Private LLM are quantised using the state of the art OmniQuant quantization algorithm, while competing apps use naive round-to-nearest quantization.
3. Private LLM is a fully native app built using C++, Metal and Swift, while many of the competing apps are (bloated) Electron based apps.

Optimized for Apple Silicon Macs with the Apple M1 chip or later, Private LLM for macOS delivers the best performance. Users on older Intel Macs without eGPUs may experience reduced performance. Please note that although the app nominally works on Intel Macs, we've stopped adding support for new models on Intel Macs due to performance issues associated with Intel hardware.

16 Feb 2025

Version 1.9.8

- Added support for 3-bit OmniQuant quantized versions of Llama 3.3 70B-based models (5 new models)
- Added support for 3-bit and 4-bit OmniQuant quantized versions of the EVA LLaMA 3.33 70B v0.1 model
- Added support for 8 new models from the Dolphin 3.0 family of models
- Added support for the unquantized version of the Llama 3.2 1B Instruct Abliterated model
- Added support for the 4-bit OmniQuant quantized Gemma 2 Ifable 9B creative writing model
- Context length is now displayed in the model quick switcher
- Fixed a crash with some newer models on older versions of macOS (Sonoma)
- Other minor bug fixes and updates

4.4 out of 5

23 Ratings

Useful

Hello, please add feature to save conversations so I can pick it up later or start a new one without deleting current history.

Finding this so useful

Finding so many uses running a local, private LLM. App works well, and I not had any issues in the two months of use. Sure, I’m still going to use ChatGPT for certain tasks, but if I need it to be private, then this app excels. Even for non-private tasks, it’s very handy to have. But the best part for me is the Shortcuts integration. I didn’t really use Shortcuts until I purchased this app, and since then, I’ve found many uses as I learn how to use Shortcuts. Tip: use ChatGPT or similar to help you create a Shortcut.

I would like to see certain features implemented, which, according to the devs, are on the roadmap. Seeing how they regularly update the app and are forthcoming on Discord, it looks promising that we’ll see improvements in the coming months. But what it does now, it is spot on.

Amazing!

Can’t believe my iPad is so powerful!! Works a charm on my M1. I download Phi3 no problem. You can also get it talk via clicking on text then speech. I then downloaded another model, which wasn’t show in list of installed models. I had to quit the app and go back into it to see the new models, then voila! [It may seem obvious but worth mentioning, some users may not quit the app, and quick to act in leaving negative feedback.] Can i make a request? Can you add the best model of Aya23 for translations?

Thanks for the review! Also thanks for reporting the downloaded model list synchronization issue. We've fixed it and it'll go out with the next update. We'd have loved to add the aya-23-8B model, but sadly it's licensed under a cc-by-nc license making it legally untenable for us to add it. We'll be adding the newer QWen2 models soon, which are liberally licensed and were trained on 29 languages (Aya models were trained on 23 languages). We expect those models to do well on translation tasks.

NOW AVAILABLE

MAJOR UPDATE

Dolphin 3.0 Models

Unlock the full potential of on-device AI with Dolphin 3.0 models, offering limitless private on-device conversations.

The developer, Numen Technologies Limited, indicated that the app’s privacy practices may include handling of data as described below. For more information, see the developer’s privacy policy.

Data Not Collected

The developer does not collect any data from this app.

Privacy practices may vary based on, for example, the features you use or your age. Learn More

Information

Provider

Numen Technologies Limited

Size

1.3 GB

Private LLM - Local AI Chat 12+

Local Offline Private AI Chat

Numen Technologies Limited

Screenshots

Description

What’s New

Ratings and Reviews

Useful

Finding this so useful

Amazing!

Developer Response ,

Events

NOW AVAILABLE

App Privacy

Data Not Collected

Information

Supports

Family Sharing

Up to six family members can use this app with Family Sharing enabled.

You Might Also Like

nproxy.org