Inference API

The Inference API enables you to run AI models on-device. mimOE supports two types of AI inference:

Generative AI

Generate text, have conversations, and create embeddings using LLM models (GGUF format).

Use cases:

Generative AI Guide: OpenAI-compatible API for chat completions and embeddings

Run classification, regression, and other predictive models for real-time inference.

Use cases:

Coming Soon

Predictive AI inference documentation is coming in a future release.