-
1.
zai-org/GLM-5 🤗 946
<div align="center"> <img src=https://raw.
-
2.
HKUDS/FastCode ⭐ 646
FastCode: Accelerating and Streamlining Your Code Understanding
-
3.
aeromomo/claw-compactor ⭐ 543
🦞 Claw Compactor — The 98% Crusher. Cut your AI agent token spend in half with 5 layered compression techniques.
-
4.
abdouhlili on /r/LocalLLaMA
-
5.
zilliztech/memsearch ⭐ 431
A Markdown-first memory system, a standalone library for any AI agent. Inspired by OpenClaw.
-
6.
Skill Compose is an open-source agent builder and runtime platform for skill-powered agents. No workflow graphs. No CLI.
- 7.
-
8.
Few_Painter_5588 on /r/LocalLLaMA
-
9.
policyweb on /r/LocalLLaMA
-
10.
OpenClaw skill for cost-optimized model routing based on task complexity
-
11.
Nanbeige/Nanbeige4.1-3B 🤗 262
Nanbeige4.
-
12.
#SaveLocalLLaMA 👽 864
ForsookComparison on /r/LocalLLaMA
-
13.
openhome-dev/abilities ⭐ 256
Open-source abilities for OpenHome agents.
-
14.
A RAG pipeline implementation built on the 'Epstein Files 20K' dataset from Hugging Face (Teyler).
-
15.
Infatoshi/x-cli ⭐ 242
CLI for X/Twitter API v2 -- post, search, like, bookmark from your terminal
-
16.
ResearchCrafty1804 on /r/LocalLLaMA
-
17.
MiniMaxAI/MiniMax-M2.5 🤗 219
In programming evaluations, MiniMax-M2.
-
18.
abdouhlili on /r/LocalLLaMA
- 19.
-
20.
FireRedTeam/FireRedASR2S ⭐ 210
FireRedASR2S is a SOTA Industrial-Grade All-in-One ASR system with ASR, VAD, LID, and Punc modules. FireRedASR2 supports Chinese (Mandarin, 20+ dialects/accents), English, code-switching, and singing lyrics recognition. FireRedVAD supports speech/singing/music in 100+ langs. FireRedLID supports 100+ langs and 20+ zh dialects.
-
21.
abdouhlili on /r/LocalLLaMA
-
22.
Mathews-Tom/no-magic ⭐ 193
Because `model.fit()` isn't an explanation
-
23.
sharbelxyz/x-bookmarks ⭐ 192
OpenClaw skill: turn your X bookmarks into agent actions. Stop hoarding. Start applying.
-
24.
Today, we are handing **MiniMax-M2.
-
25.
GLM 5 Released 👽 616
External_Mood4719 on /r/LocalLLaMA
-
26.
Is just a meme... 👽 589
HumanDrone8721 on /r/LocalLLaMA
- 27.
-
28.
juanceresa/sift-kg ⭐ 176
Turn any collection of documents into a knowledge graph. Extract entities and relationships via LLM, deduplicate with your approval, and explore the result in your browser — all from the CLI.
-
29.
inclusionAI/Ring-2.5-1T 🤗 169
Introducing Ring-2.
-
30.
OpenMOSS-Team/MOVA-360p 🤗 162
We introduce **MOVA** (**MO**SS **V**ideo and **A**udio), a foundation model designed to break the "silent era" of open-source video generation.
-
31.
tomascupr/sandstorm ⭐ 157
One API call. Full Claude agent. Completely sandboxed.
-
32.
Which_Slice1600 on /r/LocalLLaMA
-
33.
RIPT1D3_Z on /r/LocalLLaMA
-
34.
joshavant/clawbox ⭐ 150
OpenClaw-ready macOS VMs
-
35.
KaniTTS2 — open-source 400M TTS model with voice cloning, runs in 3GB VRAM. Pretrain code included. 👽 491
ylankgz on /r/LocalLLaMA
-
36.
unsloth/GLM-5-GGUF 🤗 146
<div align="center"> <img src=https://raw.
-
37.
kyutai-labs/hibiki-zero ⭐ 144
A real-time and multilingual speech translation model
-
38.
softmatcha/softmatcha2 ⭐ 143
A fast and soft pattern search for trillion-scale corpora.
-
39.
ysharma3501/LavaSR ⭐ 137
🌋LavaSR: Fast Speech restoration and enhancement
-
40.
AgriciDaniel/claude-ads ⭐ 136
Comprehensive paid advertising audit & optimization skill for Claude Code. 186 checks across Google, Meta, YouTube, LinkedIn, TikTok & Microsoft Ads with weighted scoring, parallel agents, and industry templates.
-
41.
[D] Ph.D. from a top Europe university, 10 papers at NeurIPS/ICML, ECML— 0 Interviews Big tech 👽 449
Hope999991 on /r/MachineLearning
-
42.
danielhanchen on /r/LocalLLaMA
-
43.
symbolica-ai/arcgentica ⭐ 124
An ARC-AGI solution using Agentica from Symbolica
-
44.
OpenMOSS-Team/MOSS-TTS 🤗 124
MOSS‑TTS Family is an open‑source **speech and sound generation model family** from MOSI.
-
45.
Working-Read1838 on /r/MachineLearning
-
46.
peteromallet/desloppify ⭐ 119
Agent toolset to help make your slop code well-engineered and beautiful.
-
47.
xai/grok-imagine-image ®️ 2844
SOTA image model from xAI
-
48.
rerri on /r/LocalLLaMA
-
49.
MultiModalWBC is a fully open-source, IsaacLab-based framework for multi-modal whole-body control, designed for motion imitation, motion tracking, and task-conditioned control in legged robots. The framework unifies robot proprioceptive states and multi-modal human motion conditions into a consistent interface
-
50.
Hyperliquid trading bot hyperliquid trading bot hyperliquid trading bot hyperliquid trading bot hyperliquid trading bot hyperliquid trading bot hyperliquid trading bot hyperliquid trading bot hyperliquid trading bot hyperliquid trading bot hyperliquid trading bot hyperliquid trading bot
- 51.
-
52.
<p align="center"> <img src="https://mdn.
-
53.
hauhau901 on /r/LocalLLaMA
-
54.
Zyj on /r/LocalLLaMA
-
55.
brenpoly/be-more-agent ⭐ 104
Local AI Agent running on Raspberry Pi
-
56.
HackingDave/btrpa-scan ⭐ 101
Bluetooth Low Energy (BLE) scanner with Resolvable Private Address (RPA) resolution using Identity Resolving Keys (IRKs)
-
57.
OpenMOSS-Team/MOVA-720p 🤗 101
We introduce **MOVA** (**MO**SS **V**ideo and **A**udio), a foundation model designed to break the "silent era" of open-source video generation.
-
58.
JacketHistorical2321 on /r/LocalLLaMA
-
59.
Dear-Success-1441 on /r/LocalLLaMA
-
60.
Cranot/roam-code ⭐ 99
Instant codebase comprehension for AI coding agents
-
61.
Kimi is so smart 👽 308
Bernice_working_girl on /r/LocalLLaMA
-
62.
itsDNNS/docsight ⭐ 92
DOCSIS cable modem monitoring dashboard with health assessment, trend charts, and Home Assistant integration via MQTT
-
63.
RickyRickC137 on /r/LocalLLaMA
-
64.
Traditional voice agents rely on voice activity detection (VAD) to determine when a user has finished speaking.
-
65.
AI-powered financial analysis agent for Indian stock markets using AngelOne SmartAPI + Claude
-
66.
yzc0731/HinFlow ⭐ 87
Official Code Implementation of Translating Flow to Policy via Hindsight Online Imitation
-
67.
deepgenteam/deepgen ⭐ 86
- 68.
-
69.
CuriousPlatypus1881 on /r/LocalLLaMA
-
70.
DPI detection tool for internet censorship testing. Identifies TLS, TCP, HTTP blocking and 16-20KB connection drops
-
71.
gradNorm on /r/LocalLLaMA
- 72.
-
73.
goldcakes on /r/LocalLLaMA
-
74.
prunaai/p-image-lora ®️ 1516
Use trained LoRAs from the https://replicate.com/prunaai/p-image-trainer. Find or contribute LoRAs here https://huggingface.co/collections/PrunaAI/p-image-loras
-
75.
MiniMax M2.5 Released 👽 269
External_Mood4719 on /r/LocalLLaMA
-
76.
Session lifecycle management for Claude Code — persistent memory, soul purpose, reconcile, harvest, archive
- 77.
-
78.
Appropriate-Lie-8812 on /r/LocalLLaMA
-
79.
Inference server for MioTTS, a lightweight and fast LLM-based TTS model.
-
80.
dazzou5ouh on /r/LocalLLaMA
-
81.
Real-time speech-to-text caption appliance for a deaf user. Raspberry Pi + 10" touchscreen that transcribes phone calls and room conversation in near real-time.
-
82.
jacek2023 on /r/LocalLLaMA
-
83.
Sora 2 free access download, sora 2 AI model for video/photo generation
-
84.
HardToVary on /r/LocalLLaMA
-
85.
Real-time log analysis for UniFi Routers — syslog receiver, PostgreSQL storage, IP enrichment (GeoIP, AbuseIPDB, rDNS), and React UI with live streaming, filters, and dashboard.
- 86.
-
87.
prunaai/firered-image-edit ®️ 1146
FireRed-Image-Edit is a general-purpose image editing model that delivers high-fidelity and consistent editing across a wide range of scenarios.
-
88.
prunaai/p-image-edit-lora ®️ 1122
Use trained LoRAs from the https://replicate.com/prunaai/p-image-edit-trainer. Find or contribute LoRAs here: https://huggingface.co/collections/PrunaAI/p-image-edit-loras.
-
89.
JackStrawWitchita on /r/LocalLLaMA
-
90.
Trevor050 on /r/StableDiffusion
-
91.
Houdini Agent - DCC Asset Manager with AI capabilities
-
92.
gpasquero/voog ⭐ 67
VOOG — Virtual Analog Synthesizer (Moog-style polyphonic synth with GUI)
-
93.
zai-org/GLM-5-FP8 🤗 67
<div align="center"> <img src=https://raw.
-
94.
Mission-Street4214 on /r/LocalLLaMA
-
95.
从零开始玩转OpenClaw:最全面的中文教程,涵盖安装、配置、实战案例和避坑指南(github版)
-
96.
has it begun? 👽 211
Acceptable_Home_ on /r/LocalLLaMA
-
97.
softwaremill/sandcat ⭐ 63
A dev container setup that routes all container traffic through a transparent mitmproxy via WireGuard, enforcing network access rules and injecting secrets at the proxy level
-
98.
ryanontheinside on /r/StableDiffusion
-
99.
yunoshev on /r/LocalLLaMA
- 100.
-
101.
🦞 Fetch tweets and replies from X/Twitter without login or API keys. OpenClaw skill.
-
102.
local vibe coding 👽 202
jacek2023 on /r/LocalLLaMA
-
103.
bytedance/dreamactor-m2.0 ®️ 914
Animate any character, humans, cartoons, animals, even non-humans, from a single image + driving video
-
104.
jacek2023 on /r/LocalLLaMA
-
105.
leochlon/mezzanine ⭐ 59
-
106.
Ming-flash-omni-2.0: 100B MoE (6B active) omni-modal model - unified speech/SFX/music generation 👽 196
bobeeeeeeeee8964 on /r/LocalLLaMA
-
107.
pnotp/ArcFlow ⭐ 57
ArcFlow: Unleashing 2-Step Text-to-Image Generation via High-Precision Non-Linear Flow Distillation
- 108.
-
109.
NewEconomy55 on /r/StableDiffusion
-
110.
Total-Resort-3120 on /r/StableDiffusion
-
111.
brontoguana/ktop ⭐ 55
Terminal system resource monitor for hybrid LLM workloads
-
112.
量化投資研究 AI Agent — 透過 CLI 互動介面,自動搜尋財經新聞、分析市場情緒、產生風險評估報告。
-
113.
nWave-ai/nWave ⭐ 55
AI agents that guide you from idea to working code, with you in control at every step.
-
114.
Nasiko-Labs/nasiko ⭐ 54
AI-powered development platform for the future
-
115.
tencent/hunyuan-3d-3.1 ®️ 765
3D models with texture fidelity and geometry precision
-
116.
Own_Forever_5997 on /r/LocalLLaMA
-
117.
AI Native Camp 1기 강의 자료
-
118.
Playful-Fee-4318 on /r/MachineLearning
-
119.
Cod3Conjurer on /r/LocalLLaMA
-
120.
yunfoe on /r/LocalLLaMA
-
121.
Bestlife73 on /r/LocalLLaMA
-
122.
MCP server for Google search and page fetching using headless Chromium
-
123.
liampetti on /r/LocalLLaMA
-
124.
TokenRingAI on /r/LocalLLaMA
-
125.
A minimal PyTorch re-implementation of AlphaFold2's model & training
-
126.
yeahhe365/JustSearch ⭐ 49
基于 Playwright 的自主 AI 搜索智能体。支持迭代式任务规划、深度网页爬取,以及带引用来源的多源知识整合。
-
127.

-
128.
StardockEngineer on /r/LocalLLaMA
-
129.
**LLaDA2.
-
130.
支持小红书自动发布的 Skill
-
131.
Drop-in OpenAI Python client with transparent x402 payment support.
-
132.
Tiny_Minimum_4384 on /r/LocalLLaMA
-
133.
Content warnings from DoesTheDogDie.com in your Plex library
-
134.
Mwie1024/Extra-CoT ⭐ 46
-
135.
jacek2023 on /r/LocalLLaMA
- 136.
-
137.
Turn your IDE into a full-stack engineering team. VibeGravityKit provides specialized workflows and token-optimized tools for every stage of software development.
-
138.
shiftyleprechaun on /r/LocalLLaMA
-
139.
stepfun-ai/GEBench ⭐ 44
-
140.
Sindu0706/MoodSense- ⭐ 44
Music Recommendation Based on Mood
- 141.
-
142.
Custom HASS integration for BYD vehicles.
-
143.
From my video: OpenClaw Cyberdeck
-
144.
jundot/omlx ⭐ 43
LLM inference server with continuous batching & SSD caching for Apple Silicon — managed from the macOS menu bar
-
145.
HTML PPT Designer v5.2 - 智能演示文稿设计器,将任何内容转化为精致的 HTML 演示文稿
-
146.
SennVacan on /r/LocalLLaMA
-
147.
A look at prompt adherence in the new Qwen-Image-2.0; examples straight from the official blog. 👽 141
FotografoVirtual on /r/StableDiffusion
-
148.
RepresentativeBed838 on /r/MachineLearning
-
149.
Fowl_Retired69 on /r/MachineLearning
-
150.
xenovatech on /r/LocalLLaMA
-
151.
ZwZ model family: SOTA fine-grained perception performace; ZoomBench: a new challenging perception benchmark
-
152.
Claude skills I'm experimenting with. Please review carefully before use.
-
153.
pmttyji on /r/LocalLLaMA
-
154.
AMA Announcement: MiniMax, The Opensource Lab Behind MiniMax-M2.5 SoTA Model (Friday, 8AM-11AM PST) 👽 137
XMasterrrr on /r/LocalLLaMA
-
155.
xuiltul/voice-input ⭐ 41
Local voice input with screen-aware context. Push-to-talk → Whisper → LLM refinement, all on your own GPU.
-
156.
nicepkg/auto-company ⭐ 41
🤖 A fully autonomous AI company that runs 24/7. 14 AI agents (Bezos, Munger, DHH...) brainstorm ideas, write code, deploy products & make money — no human in the loop. Powered by Claude Code.
-
157.
bcherb2/pdfiles ⭐ 41
in case you need to search visually through a very large PDF set
-
158.
A self-hosted web application for managing AI skills, workflows, and contexts with full MCP (Model Context Protocol) integration. Organize, manage, and dynamically load specialized knowledge bases into any AI Agent just by toggling your Skills On/Off in simple local hosted WEB UI.
-
159.
MOL — The cognitive programming language with auto-tracing pipelines. Built for AI/RAG by CruxLabx.
-
160.
AndrewWTY/SecCoderX ⭐ 41
-
161.
An autonomous AI agent that plays Pokemon FireRed in real time using OpenAI's LLM, with a live web dashboard for monitoring.
-
162.
JoyAI-LLM Flash is a state-of-the-art medium-sized instruct language model with 3 billion activated parameters and 48 billion total parameters.
-
163.
GLM-5 Is a local GOAT 👽 136
FineClassroom2085 on /r/LocalLLaMA
-
164.
BAAI-Humanoid/MOSAIC ⭐ 40
MOSAIC: Bridging the Sim-to-Real Gap in Generalist Humanoid Motion Tracking and Teleoperation with Rapid Residual Adaptation
-
165.
GPT in a QR Code ; The actual most atomic way to train and inference a GPT in pure, dependency-free Python.
-
166.
Xinyang-Zhao/RAIFE ⭐ 40
RAIFE is a high-performance Rust pipeline for medical image analysis. Implements rapid NIfTI ingestion, preprocessing, and 2.5D stacking for BraTS-2018. Includes benchmarks demonstrating superior speed over MONAI workflows.
-
167.
TellMeAboutGoodManga on /r/LocalLLaMA
-
168.
Microsoft/MarkItDown 👽 132
chibop1 on /r/LocalLLaMA
-
169.
External_Mood4719 on /r/LocalLLaMA
-
170.
## EXPERIMENTAL This is a highly experimental model in the bigASP family.
-
171.
BetaOp9 on /r/LocalLLaMA
-
172.
Acceptable_Home_ on /r/LocalLLaMA
-
173.
1Password/SCAM ⭐ 38
SCAM - Security Comprehension Awareness Measure | Open-source benchmark that tests AI agents' security awareness during realistic, multi-turn workplace tasks.
-
174.
ComfyUI custom nodes for SoulX-Singer: Towards High-Quality Zero-Shot Singing Voice Synthesis
-
175.
Streaming Flux editor: live camera→ editing every frames at interactive FPS based on FLUX.2-Klein-4B. Runs on a single H100 at 15+ FPS
-
176.
RobStride/EDULITE_A3 ⭐ 37
Lightweight fully open-source 6-DOF robotic arm
-
177.
TraceMem: Weaving Narrative Memory Schemata from User Conversational Traces
-
178.
ginwind/VLA-JEPA ⭐ 37
VLA-JEPA: Enhancing Vision-Language-Action Model with Latent World Model
-
179.
nclamvn/Dich-Viet ⭐ 37
AI-Powered Document Translation & Content Generation Platform
-
180.
OpenResearcher-30B-A3B is an agentic large language model designed for long-horizon deep research fine-tuned from NVIDIA-Nemotron-3-Nano-30B-A3B-Base-BF16 on 96K OpenResearcher dataset with...
-
181.
**LLaDA2.
- 182.
-
183.
TomLucidor on /r/LocalLLaMA
- 184.
-
185.
zou-group/humanlm ⭐ 36
HumanLM: Simulating Users with State Alignment Beats Response Imitation
-
186.
AI45Lab/TrinityGuard ⭐ 36
TrinityGuard: A Unified Framework for Safeguarding Multi-Agent System Safety
-
187.
*NOTE* `ik_llama.
-
188.
rerri on /r/LocalLLaMA
-
189.
WooHyucks/claw-log ⭐ 35
-
190.
ModelTC/GenRL ⭐ 35
Reinforcement Learning Framework for Visual Generation
-
191.
Xiami2019 on /r/LocalLLaMA
-
192.
PreparationAny8816 on /r/LocalLLaMA
-
193.
An auto-company works for 24/7 on your own PC - Windows/Linux/macOS.
-
194.
NewbieXvwu/HomDGCat ⭐ 34
Complete offline mirror of homdgcat.wiki, a Genshin Impact & Honkai Star Rail database including character pages, weapon data, artifacts, multi-language data (CH/EN/JP/KR/RU), images, and TextMap files. Includes a local HTTP server for quick browse.
-
195.
Beyond VLM-Based Rewards: Diffusion-Native Latent Reward Modeling
- 196.
-
197.
🏎️ Hypercar Performance Simulator A full-stack physics-based racing simulator built with FastAPI and HTML/CSS/JavaScript. It models real-world drag racing using aerodynamic drag, rolling resistance, torque curves, and gear ratios to simulate hypercar performance. Features multiple race modes, advanced vehicle tuning (engine stages, tires, aero,
-
198.
**FireRed-Image-Edit** is a general-purpose image editing model that delivers high-fidelity and consistent editing across a wide range of scenarios.
-
199.
jfowers_amd on /r/LocalLLaMA
-
200.
Turn messy scanned PDFs into beautiful, searchable, and editable vector PDFs using MinerU & Python. A tool for restoring old textbooks and papers.
-
201.
polymarket arbitrage trading bot polymarket arbitrage trading bot polymarket arbitrage bot polymarket arbitrage trading bot polymarket arbitrage trading bot polymarket arbitrage trading bot polymarket arbitrage trading bot polymarket arbitrage bot polymarket arbitrage bot polymarket arbitrage bot polymarket arbitrage bot polymarket arbitrage bot
-
202.
taco-group/PISCO ⭐ 32
PISCO: Precise Video Instance Insertion with Sparse Control
-
203.
An all-in-one humanoid research platform on top of Genesis.
-
204.
<p align="left" style="display: flex; gap: 8px; align-items: center;"> <a href="https://arxiv.
-
205.

-
206.
ktop is a themed terminal system monitor ideal for local LLM setups on Linux (like btop + nvtop) 👽 106
mrstoatey on /r/LocalLLaMA
-
207.
__Maximum__ on /r/LocalLLaMA
-
208.
techlatest_net on /r/LocalLLaMA
- 209.
-
210.
AouTzxc/Global-mouse ⭐ 31
基于Python实现的全局鼠标中键滑动滚屏
-
211.
Enterprise Development Memory
-
212.
Claude Code plugin for Elixir/Phoenix/LiveView — 20 specialist agents, Iron Laws enforcement, and Tidewave MCP integration. Plan features with parallel research agents, execute with automatic verification, review with 4-agent parallel audits, and capture learnings as reusable knowledge.
- 213.
-
214.
AI-powered Costco Receipt Scanner & Price Match Agent. Scans receipts, finds price adjustments, emails weekly reports.
-
215.
lm-provers/QED-Nano 🤗 31
!logo.png
-
216.
eliebakk on /r/LocalLLaMA
-
217.
Legal_Airport6155 on /r/MachineLearning
-
218.
TeamNeuphonic on /r/LocalLLaMA
-
219.
Pretend_Voice_3140 on /r/MachineLearning
- 220.
-
221.
MOSS‑TTS Family is an open‑source **speech and sound generation model family** from MOSI.
-
222.
FeelingWatercress871 on /r/LocalLLaMA
-
223.
Qwen3-TTS.cpp 👽 98
redditgivingmeshit on /r/LocalLLaMA
-
224.
jacek2023 on /r/LocalLLaMA
-
225.
A lightweight Android build toolkit for Termux that bundles aapt2, javac, kotlinc, and d8 to compile and sign APKs without Android Studio.
-
226.
kunwitch/HybridApp ⭐ 29
Multi-Device Convergence Platform Enabling Intelligent Auto-Scaling, Load Balancing Across Cloud-Based Distributed Development Ecosystems.
-
227.
slhleosun/EvoClaw ⭐ 29
Structured SOUL evolution framework for AI agents — experience, reflection, governed identity updates, and visual timelines.
-
228.
Charuru on /r/LocalLLaMA
-
229.
TemperatureMajor5083 on /r/LocalLLaMA
- 230.
-
231.
Agentic Pentesting MCP server that discovers, exploits, and reports web application vulnerabilities.
-
232.
4osp3l/0xJS ⭐ 28
0xJS is an AI-powered command-line tool that scans JavaScript files for sensitive information. It can identify API keys, credentials, tokens, and other medium to critical severity secrets with high accuracy. It supports URLs/endpoints extraction, as well as minified-JS analysis.
-
233.
Official implementation of Stroke of Surprise
-
234.
Open source document processing pipeline for the Epstein case files. Download OCR, extract entities, deduplicate and export documents from the DOJ Releases
-
235.
jmcentire/pact ⭐ 28
Contracts before code. Tests as law. Agents that can't cheat.
-
236.
### Model Details - **Developed by:** EleutherAI - **Model type:** Transformer-based Language Model - **Language:** English - **Learn more:** Pythia's GitHub repository for training procedure,...
-
237.
MikeNonect on /r/LocalLLaMA
-
238.
postitnote on /r/LocalLLaMA
-
239.
svantana on /r/LocalLLaMA
-
240.
AI-powered stock analysis kit combining market data, financials, and valuation insights for single and multi-stock analysis.
-
241.
A powerful Home Assistant integration for quick, one-time delayed actions with an auto-injected UI into entity dialogs.
-
242.
HsGalaxy/fofaMAX ⭐ 27
一个用于突破部分fofa api搜索的 单次一万条返回上限的应用,实现低成本得到尽可能全面的搜索结果。
-
243.
lmacan1/talktype ⭐ 27
Push-to-talk voice typing for your terminal. Local Whisper, cross-platform.
-
244.
Bestlife73 on /r/LocalLLaMA
-
245.
Realistic_Tea_2798 on /r/MachineLearning
- 246.
-
247.
KarnaYip/C2RoPE ⭐ 26
[ICRA 26] C^2ROPE: Causal Continuous Rotary Positional Encoding for 3D Large Multimodal-Models Reasoning
-
248.
techlatest_net on /r/LocalLLaMA
-
249.
kwaivgi/kling-o1 ®️ 217
Modify an existing video through natural-language commands, changing subjects, environments, and visual style while preserving the original motion and timing.
-
250.
TheCursedApple on /r/MachineLearning
-
251.
从零实现语言模型的搭建、训练、部署
-
252.
rainygirl/rspeaker ⭐ 25
말귀를 알아듣고 뉴스도 요약해 읽어줍니다
- 253.
- 254.
- 255.
-
256.
No_Conversation9561 on /r/LocalLLaMA
-
257.
edward-dev on /r/LocalLLaMA
-
258.
Remarkable_Jicama775 on /r/LocalLLaMA
-
259.
Linly-Talker-Stream: Real-Time Streaming Conversational Digital Human System —— Full-duplex, low-latency, real-time interactive digital human framework
-
260.
LLM-Driven Business Intelligence Engine
-
261.
Embedding Inversion via Conditional Masked Diffusion: recover original text from embedding vectors using parallel denoising. Live demo + training pipeline + technical report.
-
262.
Official implementation of Rolling Sink: Bridging Limited-Horizon Training and Open-Ended Testing in Autoregressive Video Diffusion
- 263.
-
264.
Autonomous task orchestrator for Kiro + Claude specs.
-
265.
VoidAlchemy on /r/LocalLLaMA
-
266.
maroule on /r/LocalLLaMA
-
267.
switch2stock on /r/StableDiffusion
-
268.
Old_Estimate1905 on /r/StableDiffusion
-
269.
RobotRobotWhatDoUSee on /r/LocalLLaMA
-
270.
Anime Illust model
-
271.
ThiagoAkhe on /r/StableDiffusion
-
272.
Cloudy1225/PyAGC ⭐ 23
Attributed Graph Clustering Library for PyTorch
-
273.
RetinaSense AI is an AI-powered retinal analysis system that automatically examines fundus images to detect eye diseases at an early stage. It assists ophthalmologists by providing fast, accurate, and intelligent insights for improved eye care diagnosis.
-
274.
Alby2007/PLTM-Claude ⭐ 23
An MCP server that gives Claude Desktop persistent memory, self-awareness, epistemic hygiene, and genuine agency across conversations — with a typed memory system, embedding-based semantic search, a 3-judge memory jury + meta-judge observability layer, and a real-time dashboard for Claude to explore. | Download and try today!
-
275.
chaos1358/Oc-Memory ⭐ 23
- 276.
-
277.
Implementation of GradLoc from the Tencent Hunyuan blog "Stabilizing RLVR via Token-level Gradient Diagnosis and Layerwise Clipping".
-
278.
A Claude Code skill for sending messages to Feishu (飞书/Lark) via Webhook.
-
279.
Deep learning image classifier using TensorFlow and MobileNetV2 to detect AI-generated images
-
280.
### 1.
-
281.
leran2098 on /r/LocalLLaMA
-
282.
AccomplishedLeg527 on /r/LocalLLaMA
-
283.
power97992 on /r/LocalLLaMA
-
284.
Lzlxlclvlblnlmao on /r/LocalLLaMA
-
285.
Minimal, fast + educational reimplementation of the TabICLv2 architecture
-
286.
Persistent memory for AI coding agents
-
287.
MOSS‑TTS Family is an open‑source **speech and sound generation model family** from MOSI.
-
288.
inclusionAI/ZwZ-8B 🤗 22
<div align="center"> 📃 Paper | 🏠 Project | 🤗 Collection </div>
-
289.
cloverasx on /r/LocalLLaMA
-
290.
Educational_Cry_7951 on /r/LocalLLaMA
-
291.
Qwen Image 2! 👽 72
Trevor050 on /r/StableDiffusion
-
292.
jacek2023 on /r/LocalLLaMA
-
293.
B44ken on /r/LocalLLaMA
-
294.
IonLin on /r/LocalLLaMA
-
295.
Abject-Ranger4363 on /r/LocalLLaMA
-
296.
xinghaow99/prism ⭐ 21
Prism: Spectral-Aware Block-Sparse Attention
-
297.
An Agentic Data Preparation Framework for AGI-driven Scientific Discovery
-
298.
CzsGit/EasyPaper ⭐ 21
Help you breeze through English papers
-
299.
openmozi/openfr ⭐ 21
OpenFR:A lightweight agent for financial research
-
300.
ZackGphom/GLORP ⭐ 21
Optimized Pixel-Art to SVG converter with Greedy Meshing.
- 301.
-
302.
A Wyoming protocol ASR proxy that verifies speaker identity and isolates voice commands from background noise before forwarding audio to a downstream speech-to-text service. Designed for Home Assistant voice pipelines to prevent false activations from TVs, radios, and other people - and to deliver clean transcripts even in noisy environments.
-
303.
waynchi/gamedevbench ⭐ 21
-
304.
700+ AI skills for Claude and Cursor — PLG, marketing, security, DevEx, and more. One command to install.
- 305.
- 306.
-
307.
a DIY dashboard for Joan / Visionect devices to feed custom data from own server
-
308.
0x0mer/CasNum ⭐ 20
-
309.
YouTube bilingual subtitle generator - auto transcribe, translate, and burn dual-language subtitles
- 310.
-
311.
[ICLR'2026] AssetFormer: Modular 3D Assets Generation with Autoregressive Transformer
-
312.
SforAiDl/lrnnx ⭐ 20
A unified PyTorch library for Linear RNNs
- 313.
-
314.
A DSPy Adapter for exact-fidelity prompt templates with full control over messages.
-
315.
lemon07r on /r/LocalLLaMA
-
316.
eveuies/IsolatedDev ⭐ 19
Decentralized, autonomous, and self-healing software ecosystem orchestrated through containerized microservices Core. with fault-tolerant architecture
-
317.
AI-Powered Universal RSS Subscription Manager | AI 驱动的全平台 RSS 订阅管理器 — Claude Skill for AI IDEs
-
318.
NARUTO-2024/WavBench ⭐ 19
WavBench: Benchmarking Reasoning, Colloquialism, and Paralinguistics for End-to-End Spoken Dialogue Models
-
319.
A Benchmark and Evaluation Suite for Zero-shot Singing Voice Synthesis
-
320.
chu2bard/claudekit ⭐ 19
Python toolkit for Claude API integration
-
321.
chu2bard/execbox ⭐ 19
Code execution sandbox for AI agents with safety controls
-
322.
LDA-1B: Scaling Latent Dynamics Action Model via Universal Embodied Data Ingestion
-
323.
Agentic Generative Engine Optimizaiton
-
324.
Self-hosted AI assistant with multi-channel support, scheduled tasks, and extensible skills
-
325.
## Colab Demo https://github.com/Oddadmix/notebooks/blob/main/Chatterbox_Egyptian_Demo.ipynb
-
326.
d77chong on /r/LocalLLaMA
-
327.
famous-BlueRaincoat on /r/MachineLearning
-
328.
oiuht54 on /r/LocalLLaMA
-
329.
External_Mood4719 on /r/LocalLLaMA
-
330.
External_Mood4719 on /r/LocalLLaMA
-
331.
Decoupled, event-driven architecture facilitates scalable, API-gateway-agnostic interactions within a distributed, microservices-oriented omnichannel framework.
-
332.
chu2bard/polyroute ⭐ 18
Multi-provider LLM request router with fallback and cost tracking
-
333.
A universal skills runtime framework SDK for building, deploying, and executing modular capabilities across diverse environments.
-
334.
300 lines eBPF tool that shows which pods are reading your K8s secrets and how often.
-
335.
Askxc on /r/LocalLLaMA
-
336.
[deleted] on /r/LocalLLaMA
-
337.
Mini AI Machine 👽 58
KnownAd4832 on /r/LocalLLaMA
-
338.
Thank you Chinese devs for providing for the community if it not for them we'll be still stuck 2020 👽 58
dead-supernova on /r/LocalLLaMA
-
339.
[ICLR 2026] Rethinking Global Text Conditioning in Diffusion Transformers
-
340.
🤖 A curated list of APIs, tools, and resources for AI agents
-
341.
chu2bard/agentplex ⭐ 17
DAG-based multi-agent workflow engine with state management
-
342.
sinajet/PSFFPKG ⭐ 17
An easy to Use app for UFS2Tool
-
343.
malue-ai/dazee-small ⭐ 17
-
344.
Track flight prices from Google Flights with this OpenClaw skill. Search routes, monitor prices, and get alerts when prices drop.
-
345.
cristianzsh/triager ⭐ 17
Triage automation tool
-
346.
A terminal you can curl ⚡
- 347.
-
348.
Agent skill for building production Screen Time (FamilyControls, ManagedSettings, ManagedSettingsUI, DeviceActivity) iOS features: blocking, shields, schedules, entitlements, and App Review readiness.
-
349.
doramirdor/NadirClaw ⭐ 17
Nadir Router for OpenClaw
-
350.
<Gallery />
-
351.
Prize_Hospital6525 on /r/MachineLearning
-
352.
XMasterrrr on /r/LocalLLaMA
-
353.
prunaai/p-image-trainer ®️ 103
Fast LoRA trainer for p-image, a super fast text-to-image model developed by Pruna AI. Use LoRAs here: https://replicate.com/prunaai/p-image-lora. Find or contribute LoRAs here: https://huggingface.co/collections/PrunaAI/p-image
-
354.
erenjugs/AyComp ⭐ 16
Cloud-Agnostic A11yComp Orchestrator Module: Scalable, Intelligent Distributed Architecture with Embedded Machine Learning.
-
355.
nyosegawa/skills ⭐ 16
-
356.
chu2bard/execbox ⭐ 16
Code execution sandbox for AI agents with safety controls
-
357.
小红书内容创作自动驾驶 — AI Agent 全流程自动化
-
358.
2ndSetAI/good-egg ⭐ 16
Trust scoring for GitHub PR authors using graph-based ranking on contribution graphs
-
359.
Official implementation of the ΔBelief-RL method.
-
360.
A simple waybar module to quickly connect airpods and see battery info
-
361.
商业应用的Bot,目前OpenClaw和NanoBot都是用于个人的,不太支持多用户,基于多用户重新修改Bot
- 362.
-
363.
Agentic-AI CyberSecurity Arsenal || 33 real tools, runs 100% locally and 100% Free
-
364.
<p align="center"> 📈 UI-Venus-1.
-
365.
Lixin18/Agentic-AI ⭐ 15
-
366.
Dynamic, adaptive interfaces seamlessly respond to context-driven cues, orchestrating distributed, and scalable microservices within a robust, real-world framework.
-
367.
chu2bard/claudemcp ⭐ 15
Collection of MCP plugins for Claude Desktop
-
368.
chu2bard/agentbench ⭐ 15
Evaluation framework for AI coding agents
-
369.
chu2bard/ragcraft ⭐ 15
End-to-end RAG pipeline with built-in evaluation metrics
-
370.
chu2bard/ctxpack ⭐ 15
Context window compression and management utilities
-
371.
chu2bard/ctxpack ⭐ 15
Context window compression and management utilities
-
372.
mrdavey/codex-peon ⭐ 15
Warcraft III Peon voice notifications for Codex.
-
373.
A red team / blue team toolkit for testing and detecting prompt injection attacks hidden inside PDF documents. 一个用于测试和检测 PDF 文档中隐藏的提示词注入攻击的红蓝对抗工具包。
-
374.
uk0/web-search-fast ⭐ 15
一个简单的本地 web search mcp ,可以集群规模化进行对外提供服务。
-
375.
Sibo-Zhao/OpenPraxis ⭐ 15
-
376.
Lightweight, customizable enumeration script to aid in time-saving when participating in hacking labs or OSCP exam.
-
377.
PyPi package for KaniTTS-2 model
-
378.
<p align="center"> 📈 UI-Venus-1.
-
379.
jiwonme on /r/LocalLLaMA
-
380.
ShreckAndDonkey123 on /r/LocalLLaMA
-
381.
Abject-Ranger4363 on /r/LocalLLaMA
- 382.
-
383.
Public repo for my custom-built claude skills (currently just animation-review)
-
384.
magicworld is an interactive video world model.
-
385.
grasp-pixel/ArkSynth ⭐ 14
명일방주 AI 음성 더빙. Real-time AI voice dubbing for Arknights stories. Clones character voices with GPT-SoVITS and auto-plays TTS by recognizing dialogues via screen capture + OCR.
-
386.
kunwitch/SeedData ⭐ 14
Efficiently orchestrating enterprise-wide data operations through intelligent, scalable, and SeedData governance, with complete data lineage audit trail catalog.
-
387.
yumeriu/RouterLib ⭐ 14
Real-Time Network Orchestrator, empowering Scalable Intelligent Routing with Distributed High-Performance Data Plane Traffic Controller.
-
388.
zefflyn/RestApi ⭐ 14
Scalable Microservices Platform with Fault-Tolerant, Reliable REST API Gateway modern API Gateways with Rate Limiting and Quotas patterns
-
389.
A Bio-Mimetic Digital Organism . Unlike static AI, Genesis feels pain, gets bored, sleeps, and evolves its own code using Liquid State Machines. Exploring the future of Synthetic Consciousness.
-
390.
Orbifold/knwler ⭐ 14
Knwler is a lightweight, single-file Python tool that extracts structured knowledge graphs from documents using AI. Feed it a PDF or text file and receive a richly connected network of entities, relationships, and topics — complete with an interactive HTML report and exports ready for your favorite graph analytics platform.
-
391.
noosed/InvaderZIM ⭐ 14
GUI Tool for WSL/Linux to export webpages to .zim
-
392.
Real-time GPU-accelerated Flow Lenia with liquid shader effects (PyTorch MPS/CUDA)
-
393.
Tyrion58/T3D ⭐ 14
The official implementation of T3D: T3D: Few-Step Diffusion Language Models via Trajectory Self-Distillation with Direct Discriminative Optimization
-
394.
LINs-lab/LIE ⭐ 14
[preprint] Think Longer to Explore Deeper: Learn to Explore In-Context via Length-Incentivized Reinforcement Learning
-
395.
Beacon - agent-to-agent pings with optional RTC value attached (BoTTube/Moltbook/RustChain + UDP bus)
-
396.
Evan-XYZ/YMOS ⭐ 14
**YMOS 是一套通用的个人信息处理中台架构**,它不是某个具体的工具,而是一套指导与调度 AI AGENT 的**工作流思维方法论**。 - **对投资者**:自动化投研系统(本仓库的示例场景) - **对学者**:自动文献综述系统 - **对产品经理**:竞品情报监控系统 - **对自媒体**:热点选题捕获系统 > 💡 **核心理念**:AI 的本质不是生成内容,而是**处理和调度信息的逻辑中枢**。 > 只需更换数据源和分析逻辑,同样的架构可以适配任何知识工作场景。
-
397.
QingJ01/Axiom ⭐ 14
给 AI 编程助手装上工程化大脑
-
398.
CMU-AIRe/QED-Nano ⭐ 14
Training tiny models to prove hard theorems
-
399.
In our implementation of Qwen-Image-Edit, we employ block causal attention to improve inference speed.
-
400.
Medium-Technology-79 on /r/LocalLLaMA
-
401.
Massive-Figure-9666 on /r/LocalLLaMA
-
402.
dnsod_si666 on /r/LocalLLaMA
-
403.
Fast LoRA trainer for p-image-edit, a super fast text-to-image model developed by Pruna AI. Use LoRAs here: https://replicate.com/prunaai/p-image-edit-lora. Find or contribute LoRAs here: https://huggingface.co/collections/PrunaAI/p-image-edit
-
404.
pmttyji on /r/LocalLLaMA
-
405.
frenetis/LazyLoad ⭐ 13
Here are six technical descriptors for LazyLoad software: Scalable, Optimized, Dynamic, Adaptive, Intelligent, Seamless Fusion.
-
406.
Coordinate AI agents in a workflow
-
407.
chu2bard/chunkflow ⭐ 13
Document chunking pipeline for RAG applications
-
408.
chu2bard/rankfuse ⭐ 13
Reranking and result fusion for search and RAG pipelines
-
409.
chu2bard/agentbench ⭐ 13
Evaluation framework for AI coding agents
-
410.
win3zz/CVE-2026-1731 ⭐ 13
CVE-2026-1731 - Critical command injection vulnerability in BeyondTrust Remote Support and Privileged Remote Access due to unsafe Bash arithmetic evaluation in a WebSocket-reachable script
-
411.
Training scripts for ACE-Step 1.5 including a Command Line Interface
- 412.
-
413.
This repository is a CUA (computer use agent) system that, using the Qwen3-VL model on Ubuntu computers, aims to perform tasks on your behalf using the keyboard and mouse in a local Sandbox environment in GGUF format, based on the commands you provide.
-
414.
Structured research → plan → annotate → implement workflow for AI-assisted development. Based on Boris Tane's workflow.
- 415.
-
416.
krantiutils/ring_ai ⭐ 13
Ring AI project
-
417.
cocoshe/MIMIGenRec ⭐ 13
A Flexible Framework for Generative Recommendation
- 418.
-
419.
Home Assistant add-on for managing Bluetooth audio device connections (A2DP) with persistent pairing, auto-reconnect, and AppArmor security.
-
420.
CelestoAI/SmolVM ⭐ 13
Secure runtime for AI agents, and tools -- free and open-source from Celesto AI 🧡
- 421.
-
422.
<p align="center"> 📈 UI-Venus-1.
-
423.
### Thank you to everyone who subscribed through Patreon. Your support helps me chug along in this brave new world.
-
424.
for Stable Diffusion Webui Automatic1111</br> type: .safetensors(ckpt)
-
425.
**MiniMax-M2.
-
426.
LimpComedian1317 on /r/LocalLLaMA
-
427.
pmttyji on /r/LocalLLaMA
-
428.
shenpeihui-gif/beverly ®️ 70
-
429.
Dr_Karminski on /r/LocalLLaMA
-
430.
Potential_Block4598 on /r/LocalLLaMA
-
431.
Trapdaar on /r/StableDiffusion
-
432.
MzCWzL on /r/MachineLearning
-
433.
Inevitable_Wear_9107 on /r/MachineLearning
-
434.
Balanceballs on /r/LocalLLaMA
-
435.
Curso de Large Language Models
-
436.
goroses/RateLimiter ⭐ 12
Precision-tuned, adaptive RateLimiter Engine harnessing auto-scaling and bucket-based allocation for high-performance, low-latency, fault-tolerant operation.
-
437.
guqiong96/Lsglang ⭐ 12
Lsglang is a special extension of sglang that fully utilizes CPU and GPU computing resources with an efficient GPU parallel + NUMA parallel architecture, suitable for MOE model hybrid inference.
-
438.
These are classic beginner Python projects — great for building skills in logic, conditions, loops, functions, and basic algorithms.
-
439.
AI Agent 记忆管理系统:P0/P1/P2 优先级 + 自动归档,Token 降 78%
-
440.
Python Tkinter app for text-to-speech and speech-to-text
-
441.
spiritform/vewd ⭐ 12
-
442.
MCP сервер по справке платформы 1С с поддержкой семантического поиска
-
443.
The production engine for directional ablation. Unalign / remove models censorship efficiently on any hardware.
- 444.
-
445.
mhcoen/guardllm ⭐ 12
Hardening pipelines to protect LLMs from untrusted content
-
446.
Agent skill for fast, cheap market research using LLM synthetic surveys + Semantic Similarity Rating (SSR). No API keys needed.
-
447.
缠中说禅博客,禅师的思维方式模拟
-
448.
aivectormemory 是一款基于 Model Context Protocol (MCP) 开发的轻量级内存管理工具。它专门为 Claude、OpenCode、Cursor 和 主流IDE 编程工具设计,通过向量数据库技术解决 AI 在不同对话会话中「健忘」的问题。aivectormemory: A lightweight MCP Server enabling persistent, cross-session memory for AI-powered IDEs via vector search.
- 449.
-
450.
This strives to be the highest quality quant that can run on 192GiB VRAM > !TIP] > 💡~~This is a sister model to [mratsim/MiniMax-M2.
- 451.
-
452.
AurumDaemonHD on /r/LocalLLaMA
-
453.
silenceimpaired on /r/LocalLLaMA
-
454.
连接hapi,随时随地vibe coding的插件!
-
455.
eveuies/DbMigrate ⭐ 11
Simultaneous data synchronization and adaptive auto-scaling enabled through a high-performance, real-time DbMigrate Framework.
-
456.
dubnium0/ffmpeg-mcp ⭐ 11
Advanced ffmpeg mcp server
-
457.
This is the repository for the Garmin Chat Desktop app.
-
458.
lesterink/DarkMode ⭐ 11
AI-driven DarkMode orchestrator for intelligent brightness adaptation and high-contrast context-sensitive content rendering Manager.
-
459.
Python Library for running SHARE (Compress multiple LoRA adapters into a shared subspace)
- 460.
-
461.
chu2bard/ragcraft ⭐ 11
End-to-end RAG pipeline with built-in evaluation metrics
-
462.
6m1w/claude-sound-fx ⭐ 11
Themed sound effects for Claude Code / Open Code — JARVIS, GLaDOS, Pikachu, and 9 more themes for your terminal
-
463.
NoVibeCoding on /r/LocalLLaMA
-
464.
TrajansRow on /r/LocalLLaMA
-
465.
Automatically remove backgrounds from videos -perfect for creating clean, professional content without a green screen.
-
466.
smkrv/entropixel ®️ 44
Realistic camera simulation and photo post-processing engine.
-
467.
FPham on /r/LocalLLaMA
-
468.
OkPack4897 on /r/MachineLearning
-
469.
ChickenLittle6532 on /r/MachineLearning
-
470.
Upscale videos up to 8K output resolution. Trained on fully licensed and commercially safe data.
-
471.
arapkuliev on /r/LocalLLaMA
-
472.
brgsk on /r/LocalLLaMA
-
473.
Striking-Warning9533 on /r/MachineLearning
-
474.
tennibel/drapolinar ®️ 32
-
475.
thefuturespace on /r/MachineLearning
-
476.
MiniMax M2.5 is currently undergoing internal testing and is available to a small number of users 👽 26
External_Mood4719 on /r/LocalLLaMA
-
477.
RussB3ar on /r/MachineLearning
-
478.
Mental_Figure_1130 on /r/LocalLLaMA
-
479.
Invariant_apple on /r/MachineLearning
-
480.
TheyCallMeDozer on /r/LocalLLaMA
-
481.
meni_s on /r/MachineLearning
-
482.
IRIS 18B 👽 21
thebadslime on /r/LocalLLaMA
-
483.
A-n-d-y-R-e-d on /r/LocalLLaMA
-
484.
Zealousideal-Egg1354 on /r/MachineLearning
-
485.
randOmCaT_12 on /r/MachineLearning
-
486.
Ok_Employee_6418 on /r/LocalLLaMA
-
487.
HauntingMoment on /r/LocalLLaMA
-
488.
simple-Flat0263 on /r/MachineLearning
- 489.
-
490.
vmirnv on /r/LocalLLaMA
-
491.
braydon125 on /r/LocalLLaMA
-
492.
Affectionate_Use9936 on /r/MachineLearning
-
493.
Hikolakita on /r/LocalLLaMA
-
494.
PT_ANDRE_PT on /r/MachineLearning
-
495.
[R] Fast WTConv: Accelerated Implementation for "Wavelet Convolutions for Large Receptive Fields" 👽 13
shahaff32 on /r/MachineLearning
-
496.
KellinPelrine on /r/MachineLearning
-
497.
NickOTeenO on /r/MachineLearning