Explore Projects

Discover 158 open source projects

Active filters (1):
Search: indexing×
Clear all

Showing 1-20 of 158 projects

fighting41love/funNLP

Comprehensive Chinese NLP resource collection for developers

79.2K
Archived
Python
LLM Frameworks
RAG & Vector
Python
#nlp#chinese-nlp#ai-resources

unionlabs/union

Zero-knowledge bridging protocol for blockchain interoperability and DeFi

74.3K
Active
Rust
DeFi
Containerization
Cosmos SDK
#blockchain#zero-knowledge#interoperability

pathwaycom/llm-app

AI pipelines for RAG, enterprise search, and document indexing with real-time data sync

56.2K
Stable
Jupyter Notebook
RAG & Vector
LLM Frameworks
Jupyter Notebook
#ai-pipelines#rag#llm

run-llama/llama_index

LLamaIndex framework for building LLM-powered agents

47.4K
Active
Python
Agents & Orchestration
Desktop Model Runners
LLM Frameworks
#llamaindex#agents#fine-tuning

9001/copyparty

Portable file server with WebDAV, SFTP, FTP, TFTP, and media indexing

43.0K
Active
Python
File Storage & Upload
Python
#file-server#webdav#sftp

paperless-ngx/paperless-ngx

Document management system for scanning, indexing, and archiving documents

37.1K
Active
Python
Collaboration & Real-time
Documentation
Django
#document-management#ocr#machine-learning

grafana/loki

Loki is a log aggregation system inspired by Prometheus, designed for cost-effective, easy-to-operate logging with label-based indexing.

27.7K
Active
Go
Monitoring
Grafana
#logging#cloudnative#grafana

NirDiamant/RAG_Techniques

Advanced RAG techniques for enhanced AI systems

25.8K
Active
Jupyter Notebook
RAG & Vector
Tutorials & Courses
LangChain
#ai#rag#langchain

langfuse/langfuse

LLM engineering platform for observability, evaluation, and prompt management

22.7K
Active
TypeScript
LLM Frameworks
LLM Wrappers & SDKs
LangChain
#llm-observability#llm-evaluation#prompt-management

valeriansaliou/sonic

Fast, lightweight search backend alternative to Elasticsearch

21.2K
Active
Rust
Search
#search-engine#rust#search-server

VectifyAI/PageIndex

A document indexing and retrieval system for large language models (LLMs) and reasoning-based RAG models.

20.5K
Active
Python
LLM Frameworks
Python
#llm#rag#retrieval

index-tts/index-tts

An efficient zero-shot text-to-speech system with fine-grained control over the generated voice.

19.1K
Stable
Python
AI Voice & Speech
Python
#text-to-speech#zero-shot#voice-cloning

subquery/subql

An open-source data indexing framework for building decentralized web3 applications.

18.9K
Active
TypeScript
GraphQL
TypeScript
#web3#decentralized#indexing

comet-ml/opik

A comprehensive library for debugging, evaluating, and monitoring LLM applications, RAG systems, and agentic workflows.

18.0K
Active
Python
LLM Frameworks
Python
#llm-evaluation#llm-observability#llm-monitoring

Jackett/Jackett

Jackett is an API server that acts as a proxy for your favorite torrent trackers, making them searchable from any app.

15.0K
Active
C#
API Clients & Testing
#torrent#indexer#proxy

blevesearch/bleve

A modern text/numeric/geo-spatial/vector indexing library for Go developers.

11.0K
Active
Go
API Frameworks
#search#indexing#text-processing

yichuan-w/LEANN

A Python library for efficient RAG (Retrieval-Augmented Generation) applications with AI-powered vector database and private storage.

10.3K
Active
Python
LLM Frameworks
React
#vector-database#private-storage#RAG-applications

johnkerl/miller

Miller is a powerful CLI tool for processing tabular data like CSV, TSV, and JSON, similar to awk, sed, and other Unix utilities.

9.8K
Active
Go
CLI Tools
#csv#json#data-processing

bchavez/Bogus

A simple fake data generator for .NET that helps developers quickly create realistic test data.

9.6K
Stable
C#
Validation
#data-generator#fake-data#testing

tidwall/tile38

Real-time geospatial and geofencing database for location-aware applications and services.

9.6K
Active
Go
API Frameworks
#geospatial#geofencing#location
2...8

Stay in the loop

Get weekly updates on trending AI coding tools and projects.