hf-hub

reqwest-middleware tokenizers minijinja

Open source Granola AI Alternative

⑂ 632◎ 10

kreuzberg★ 8.5kactive

tokenizers napi minijinja

A polyglot document intelligence framework with a Rust core. Extract text, metadata, images, and structured information from PDFs, Office documents, images, and 97+ formats. Available for Rust, Python, Ruby, Java, Go, PHP, Elixir, C#, R, C, TypeScript (Node/Bun/Wasm/Deno)- or use via CLI, REST API, or MCP server.

⑂ 497◎ 8

mistral.rs★ 7.3kactive

ai cli

Fast, flexible LLM inference

tokenizers minijinja metrics-exporter-prometheus

⑂ 629◎ 356

dynamo★ 7.2kactive

ai grpc

A Datacenter Scale Distributed Inference Serving Framework

tokenizers crossbeam-queue aho-corasick

⑂ 1.2k◎ 766

lance★ 6.6kactive

tokenizers crossbeam-queue libm

Open Lakehouse Format for Multimodal AI. Convert from Parquet in 2 lines of code for 100x faster random access, vector index, and data versioning. Compatible with Pandas, DuckDB, Polars, Pyarrow, and PyTorch with more integrations coming..

⑂ 696◎ 1.1k

text-embeddings-inference★ 4.9kactive

ai grpc

A blazing fast inference solution for text embeddings models

tokenizers metrics-exporter-prometheus ndarray

⑂ 402◎ 185

koharu★ 4.7kactive

tokenizers reqwest-middleware keyring

ML-powered manga translator, written in Rust.

⑂ 274◎ 107

RuVector★ 4.2kactive

RuVector is a High Performance, Real-Time, Self-Learning Ai, Vector GNN, Memory DB built in Rust.

tokenizers napi rkyv

⑂ 555◎ 147

spiceai★ 3.0kactive

tokenizers arrow-array wat

A portable accelerated SQL query, search, and LLM-inference engine, written in Rust, for data-grounded AI apps and agents.

⑂ 199◎ 379

sail★ 2.9kactive

grpc axum

Drop-in Apache Spark replacement written in Rust, unifying batch processing, stream processing, and compute-intensive AI workloads.

aws-credential-types num secrecy

⑂ 166◎ 219

luminal★ 2.9kactive

Inference at the speed of light.

tokenizers dyn-clone half

⑂ 210◎ 33

ck★ 1.6kactive

ai cli

Local first semantic and hybrid BM25 grep / search tool for use by AI and humans!

tokenizers syntect tree-sitter

⑂ 70◎ 27

mesh-llm★ 1.2kactive

chacha20poly1305 keyring napi

Distributed AI/LLM for the people. Share compute privately or publicly to power your agents and chat.

⑂ 144◎ 47

stoolap★ 1.2kactive

tokenizers lz4_flex web-time

A Modern Embedded SQL Database written in Rust

⑂ 42◎ 8

larql★ 1.0kactive

tokenizers minijinja wat

⑂ 179◎ 10

candle-vllm★ 678active

tokenizers minijinja flume

Efficent platform for inference and serving local LLMs including an OpenAI compatible API server.

⑂ 80◎ 28

Locus★ 537active

tokenizers keyring tree-sitter

The open source Unity Dev Agent

⑂ 62◎ 23

next-plaid★ 467active

tokenizers syntect tree-sitter

NextPlaid, ColGREP: Multi-vector search, from database to coding agents.

⑂ 56◎ 17

models★ 370maintenance

Models and examples built with Burn

tokenizers libm csv

⑂ 62◎ 19

smg★ 349active

tokenizers aho-corasick minijinja

Engine-agnostic LLM gateway in Rust. Full OpenAI & Anthropic API compatibility across vLLM, TRT-LLM, TokenSpeed, SGLang, OpenAI, Gemini & more. Industry-first gRPC pipeline, KV cache-aware routing, chat history, tokenization caching, Responses API, embeddings, WASM plugins, MCP, and multi-tenant auth.

⑂ 103◎ 78

karukan★ 306active

tokenizers unicode-normalization directories

Japanese Input Method System for Linux, macOS, Neural Kana-Kanji Conversion Engine

⑂ 19◎ 6

devo★ 298active

db cli

Open coding agent, provider agnositc.

syntect keyring tree-sitter

⑂ 129◎ 1

mofa★ 290active

tokenizers ndarray fs_extra

MoFA - Modular Framework for Agents. Modular, Compositional and Programmable.

rocksdb flume aws-sdk-s3

⑂ 174◎ 729

xybrid★ 244active

ai tokio

Build apps powered by on-device AI

⑂ 25◎ 27

pie★ 172active

tokenizers aho-corasick wasmtime-wasi

Pie: Programmable LLM Serving

⑂ 20◎ 14

active-call★ 170active

minijinja ndarray unicode-normalization

A SIP/WebRTC voice agent

⑂ 28◎ 4

aft★ 138active

tokenizers tree-sitter ndarray

Give your agent a proper IDE and OS. The sensorimotor cortex for coding agents (OpenCode + Pi), part of CortexKit: symbol-aware edits, semantic search, code health, fast grep/glob, bash compression, background tasks, PTY.

⑂ 16◎ 20

lethe★ 118active

rpassword pulldown-cmark dotenvy

Autonomous executive assistant with persistent memory and a multi-agent architecture

⑂ 14◎ 3

surrealml★ 118dormant

tokenizers ndarray wasmtime-wasi

A machine learning library for Python and Rust, for PyTorch, Tensorflow and SKLearn models

⑂ 19◎ 20

aprender★ 102active

tokenizers pollster syntect

Next Generation Machine Learning, Statistics and Deep Learning in PURE Rust

⑂ 20◎ 19

mentedb★ 96active

ai grpc

A cognition aware database engine for AI agent memory. Purpose built in Rust with WAL, HNSW, knowledge graphs, and speculative context pre assembly. Not a wrapper, a ground up storage engine that thinks.

tokenizers crossbeam-utils lz4_flex

⑂ 4◎ 13

mlxcel★ 83active

tokenizers minijinja libm

MLX-based experimental inference engine

⑂ 16◎ 17

llamastash★ 53active

ai cli

A fast terminal native app (TUI) and CLI with init wizard for launching local LLMs via llama.cpp with zero overhead

arboard windows directories

⑂ 3◎ 4

alith★ 44active

chain ai

Simple, Composable, High-Performance, Safe and Web3 Friendly AI Agents and LazAI Gateway for Everyone

tokenizers napi minijinja

⑂ 31◎ 34

origin★ 42active

Local-first AI work memory that compounds: capture decisions, lessons, gotchas in flow, distill into source-backed wiki pages, recall across sessions and any MCP client (Claude Code, Codex, etc...). Plain Markdown you own.

fs2 encoding_rs subtle

⑂ 3◎ 8

zeph★ 41active

tokenizers chacha20poly1305 tree-sitter

A memory-first AI agent that remembers why decisions were made — not just the last message. Runs local (Ollama), cloud (Claude · OpenAI · Gemini), or decentralized TEE. Graph memory, self-learning skills, multi-model routing, sandboxed tools. MCP · ACP · A2A. One Rust binary.

⑂ 4◎ 121

laminardb★ 37active

tokenizers humantime-serde arrow-array

Open-source streaming SQL engine written in Rust using Apache Arrow and DataFusion. Supports continuous queries, temporal stream joins, tumbling/session windows, and CDC/Kafka connectors. Lightweight, embeddable, and sub-microsecond latency

⑂ 4◎ 25

uni-db★ 37active

tokenizers crossbeam-queue arrow-array

Uni is a modern, embedded database that combines property graph (OpenCypher), vector search, and columnar storage (Lance) into a single, cohesive engine. It is designed for applications requiring local, fast, and multimodal data access, backed by object storage (S3/GCS) durability.

⑂ 4◎ 7

caro★ 34active

tokenizers arrow-array arrow-schema

caro: fast Rust CLI that turns natural‑language tasks into a safe POSIX command. Built for macOS (MLX/Metal) with a built‑in model; supports vLLM/Ollama/LM Studio. JSON‑only output, safety checks, confirmation, multi‑step goals, devcontainer included.

⑂ 5◎ 311

route96★ 27active

A Blossom/NIP96 server

axum-extra config sqlx

⑂ 7◎ 6

mcpmate★ 27active

tokenizers keyring handlebars

MCPMate is a progressive MCP management center for organizing servers, clients, profiles, capabilities, and runtime visibility in one local workspace.

⑂ 6◎ 1

LRC★ 27active

为 AI 记忆提供了一种从“容器”到“场”的范式迁移，实现永久记忆。

tokenizers ureq sqlx

⑂ 5

mold★ 24active