ai-inference

Here are 35 public repositories matching this topic...

bentoml / BentoML

The easiest way to serve AI apps and models - Build Model Inference APIs, Job queues, LLM apps, Multi-model pipelines, and more!

python machine-learning deep-learning model-serving multimodal mlops ml-engineering ai-inference llm generative-ai llmops llm-serving model-inference-service llm-inference inference-platform

Updated Feb 11, 2026
Python

uxlfoundation / scikit-learn-intelex

Star

Extension for Scikit-learn is a seamless way to speed up your Scikit-learn application

python machine-learning big-data analytics gpu scikit-learn machine-learning-algorithms data-analysis hacktoberfest ai-training oneapi ai-inference swrepo ai-machine-learning

Updated Feb 13, 2026
Python

uxlfoundation / oneDAL

Star

oneAPI Data Analytics Library (oneDAL)

data-science machine-learning big-data analytics cpp machine-learning-algorithms data-analysis hacktoberfest ai-training onedal oneapi ai-inference swrepo ai-machine-learning

Updated Feb 12, 2026
C++

intel / dffml

Star

The easiest way to use Machine Learning. Mix and match underlying ML libraries and data set sources. Generate new datasets or modify existing ones with ease.

Updated Aug 25, 2024
Python

blace-ai / blace-ai

Star

Cross-platform c++ sdk & model hub for cross-platform AI inference. Ready-to-deploy models including Segment Anything 3, Depth Anything 2 and Gemma.

cpp ai-sdk libtorch onnxruntime ai-inference ai-hub

Updated Feb 4, 2026
C++

okba14 / NeuroHTTP

Star

High-Performance AI-Native Web Server — built in C & Assembly for ultra-fast AI inference and streaming.

open-source web-server high-performance assembly grpc low-latency neural http3 c-language grpc-server ai-inference ai-native ai-inference-server

Updated Jan 20, 2026
C

philips-software / go-hsdp-api

Star

Client library to interact with various APIs used within Philips in a simple and uniform way

iot auditing logging iam pki fhir cartel cdr fhir-client cdl ironio tdr hsdp hsdp-api ai-training ai-inference ai-workspace

Updated Jan 29, 2026
Go

redbco / infermesh

Star

GPU-aware inference mesh for large-scale AI serving

rust distributed-systems fault-tolerance high-availability service-mesh observability inference-engine model-serving ml-infrastructure ai-inference gpu-inference ai-infrastructure gpu-mesh

Updated Sep 25, 2025
Rust

ayutaz / uPiper

Sponsor

Star

Unity TTS plugin: Piper neural synthesis + OpenJTalk Japanese + Unity AI Inference Engine. Windows/Mac/Linux/Android/iOS ready. High-quality voices for games & apps.

text-to-speech unity tts unity-plugin ai-inference

Updated Feb 6, 2026
C#

Intrafere / MOTO-Autonomous-ASI

Star

MOTO - Autonomous ASI Deep Research Harness by Intrafere - creative novelty-seeking mathematics researcher for S.T.E.M. users, run for days at a time once pressing start - no interaction needed! MOTO uses simultaneous agents working in parallel from either local host LM studio, OpenRouter, or both. Star us and follow - there's more to come soon!

python mathematics multi-agent autonomous theoretical-physics autonomous-agents ai-research ai-inference large-language-models autonomous-ai deep-research research-harness

Updated Feb 14, 2026
Python

cipherflow-fhe / lattisense

Star

A development framework for Fully Homomorphic Encryption (FHE)

machine-learning cuda data-analysis homomorphic-encryption privacy-preserving fhe secure-multiparty-computation confidential-computing ai-inference

Updated Feb 11, 2026
C++

open-vela / apps_mlearning_tflite-micro

Star

Customed version of Google's tflite-micro

tflite-micro ai-inference inference-embedded-engine

Updated Feb 2, 2026
C++

dhanushk-offl / ai-inference-backend-boilerplate

Sponsor

Star

A powerful, faster, scalable full-stack boilerplace for AI inference using Node.js, Python, Redis, and Docker

nodejs redis ai boilerplate-template transformer backend-api ai-backend ai-inference ai-backend-server

Updated Feb 3, 2026
JavaScript

avikeid2007 / KaiROS-AI

Star

KaiROS AI— Intelligence, Precisely When It Matters.

dotnet wpf gpu-acceleration msix chat-assistant ai-inference local-llm llm-inference local-ai ollama

Updated Jan 29, 2026
C#

scalytics / ScalyticsCopilot

Star

Apache 2.0-licensed open source operations stack for private AI inference with open models. Run LLMs (7B-70B) locally with vLLM, OpenAI-compatible API, web dashboard, chat UI, admin panel, and hardware monitoring.

self-hosted ai-inference vllm local-llm ollama llama3 private-ai openai-api-compatible

Updated Feb 14, 2026
JavaScript

deapi-ai / deapi-tester

Star

Open-source developer tool for testing deAPI.ai endpoints — unified AI inference API for image, video, audio, transcription, OCR and more

flux text-to-speech typescript nextjs api-client developer-tools image-generation unified-api ai-api video-generation ai-models ai-inference stable-diffusion gpu-inference deapi

Updated Feb 12, 2026
TypeScript

robert008 / flutter_face_kit

Star

A personal demo project for Flutter + ONNX Runtime integration. Not related to any company work.A comprehensive on-device face recognition SDK for Flutter

open-source opencv ffi face-recognition flutter-plugin flutter-demo insightface onnxruntime ai-inference ondevice-ai