Explore Projects

Discover 3,485 open source projects

Active filters (1):
Search: data×
Clear all

Showing 501-520 of 3,485 projects

igorbarinov/awesome-data-engineering

A curated list of data engineering tools for software developers, not focused on AI coding tools.

8.3K
Active
Databases
CLI Tools
#data-engineering#tools#awesome-list

louis-e/arnis

Generates detailed Minecraft locations using OpenStreetMap data

8.3K
Active
Rust
Tauri
#Minecraft#OpenStreetMap#Rust

boostorg/boost

Boost is a popular C++ library that provides a wide range of utility functions and data structures.

8.3K
Active
HTML
Backend Frameworks
C++
#c++#utility-library#data-structures

gonum/gonum

Gonum is a set of numeric libraries for the Go programming language, providing tools for data analysis, scientific computing, and more.

8.3K
Active
Go
API Frameworks
Databases
Go
#data-analysis#matrix#graph

pentaho/pentaho-kettle

Pentaho Data Integration (ETL) is a Java-based tool for building data integration and ETL pipelines.

8.3K
Active
Java
ETL & Pipelines
#etl#data-integration#pentaho

CVHub520/X-AnyLabeling

Effortless data labeling with AI support from Segment Anything and other powerful models.

8.3K
Active
Python
Computer Vision
ML Ops
Python
#artificial-intelligence#computer-vision#image-annotation

gyroflow/gyroflow

Video stabilization tool using gyroscope data, built with the Rust programming language.

8.3K
Active
Rust
API Frameworks
CLI Tools
#video-stabilization#gyroscope#fps

mark3labs/mcp-go

A Go implementation of the Model Context Protocol (MCP) for integrating LLM apps with external data and tools.

8.3K
Active
Go
MCP Frameworks
LLM Frameworks
#llm#model-context-protocol#integration

CASIA-LMC-Lab/FastSAM

Fast Segment Anything is a Python library for efficient data segmentation.

8.3K
Archived
Python
React
#segmentation#data-processing#efficient-data-management

apify/crawlee-python

Crawlee is a powerful web scraping and browser automation library for Python to build reliable crawlers.

8.2K
Active
Python
API Clients & Testing
Backend Frameworks
Playwright
#web-scraping#crawling#automation

jupyterhub/jupyterhub

A multi-user server for Jupyter notebooks, enabling collaborative data science and AI development.

8.2K
Active
Python
MCP Servers
Databases
Python
#jupyter#notebooks#multi-user

kangvcar/InfoSpider

INFO-SPIDER is an open-source web scraping toolkit that helps users retrieve data from various sources like email, e-commerce, and social platforms.

8.2K
Active
Python
Backend Frameworks
ETL & Pipelines
Python
#web-scraping#data-extraction#open-source

jivoi/awesome-ml-for-cybersecurity

A curated list of machine learning techniques and resources for cybersecurity professionals.

8.2K
Archived
Machine Learning Ops
Security Research
#cybersecurity#data-mining#machine-learning

hediet/vscode-debug-visualizer

An extension for VS Code that visualizes data during debugging, helping developers understand complex data structures.

8.2K
Experimental
TypeScript
Charts & Visualization
IDE Extensions
VS Code
#visualization#debugging#vscode-extension

EndlessCheng/codeforces-go

A library of algorithm templates and solutions for competitive programming on Codeforces in Go.

8.2K
Active
Go
Coding Challenges
CLI Tools
Go
#algorithms#competitive-programming#codeforces

alibaba/otter

This is a distributed database synchronization system from Alibaba that helps solve data replication challenges across different data centers.

8.1K
Archived
Java
API Frameworks
Databases
#data-replication#distributed-system#database-sync

jackzhenguo/python-small-examples

A collection of Python code examples and tutorials for data science, machine learning, and web development.

8.1K
Archived
Python
Data Science
Backend Frameworks
Python
#data-science#machine-learning#web-development

krahets/LeetCode-Book

A collection of coding solutions and interview preparation resources for LeetCode, algorithms, and data structures in Java, Python, and C++.

8.1K
Stable
Java
Coding Challenges
Books & Guides
Java
#algorithms#data-structures#coding-challenges

lmcinnes/umap

UMAP is a dimension reduction library that can be used for visualization, exploration, and analysis of high-dimensional datasets.

8.1K
Active
Python
Dimensionality Reduction
ML Ops
Python
#dimensionality-reduction#machine-learning#topological-data-analysis

mockoon/mockoon

Mockoon is an open-source, TypeScript-based mock API tool for local development and prototyping.

8.1K
Active
TypeScript
API Mocking
React
#mocking-server#open-source#local-development
1...2527...175

Stay in the loop

Get weekly updates on trending AI coding tools and projects.