Explore Projects

Discover 3,485 open source projects

Active filters (1):
Search: dataร—
Clear all

Showing 2641-2660 of 3,485 projects

juliasilge/tidy-text-mining

A manuscript for a book on tidy text mining with R, a popular data analysis language.

1.4K
Experimental
TeX
Data & Databases
Books & Guides
#text-mining#r#tidyverse

crazyhottommy/getting-started-with-genomics-tools-and-resources

A collection of Unix, R, and Python tools for bioinformatics and data science projects.

1.4K
Stable
Shell
Databases
CLI Tools
#bioinformatics#data-science#genomics

francisrstokes/construct-js

A TypeScript library for creating byte-level data structures for binary data manipulation.

1.4K
Archived
TypeScript
API Clients & Testing
API Frameworks
TypeScript
#binary#byte#data-structures

NanoNets/docstrange

An intelligent document parsing tool that extracts and converts data from various document formats to structured data like Markdown, JSON, CSV, and HTML.

1.4K
Stable
Python
LLM Wrappers & SDKs
API Frameworks
Python
#ocr#pdf-parser#document-parsing

locationtech/geotrellis

GeoTrellis is a geographic data processing engine for high performance applications.

1.4K
Active
Scala
React
#geographic data processing#high performance applications#GeoTrellis engine

HumanSignal/Adala

Adala is an autonomous data labeling agent framework for building AI applications with GPT-4 and other AI tools.

1.4K
Active
Python
Agents & Orchestration
LLM Frameworks
Python
#agent-based-framework#autonomous-agents#gpt-4

sensorsdata/sa-sdk-android

A lightweight Android SDK for data collection and tracking, including codeless tracking and visualization.

1.4K
Active
Java
Analytics & Tracking
#analytics#tracking#codeless-tracking

Deeksha2501/Data-Structures-and-Algorithms-Notes

A collection of notes on data structures and computer science fundamentals for developer interviews.

1.4K
Archived
Tutorials & Courses
Interview Prep
#data-structures#algorithms#computer-science

taglib/taglib

TagLib is a C++ library for reading and writing audio metadata, supporting multiple formats.

1.4K
Active
C++
API Frameworks
#audio#metadata#tags

spatie/laravel-searchable

A pragmatic Laravel package for searching through models and other data sources.

1.4K
Stable
PHP
API Frameworks
Search
Laravel
#search#laravel#models

AnghelLeonard/Hibernate-SpringBoot

A collection of best practices for Java persistence performance in Spring Boot applications

1.4K
Archived
Java
API Frameworks
ORMs & Query Builders
Spring Boot
#hibernate#spring-data#sql-performance

sharksforarms/deku

Deku is a Rust crate that provides a declarative API for reading and writing binary data at the bit level.

1.4K
Active
Rust
API Frameworks
CLI Tools
#bits#bytes#declarative

aio-libs/aiokafka

An asyncio client for Apache Kafka, a distributed streaming platform for building real-time data pipelines and streaming apps.

1.4K
Active
Python
Realtime
Caching
#kafka#streaming#data-pipelines

react-component/table

A feature-rich and highly customizable React table component for building complex data tables.

1.4K
Active
TypeScript
Component Libraries (React)
React
#react-component#data-table#customizable

samwafgo/SamWaf

SamWaf is a lightweight, open-source web application firewall for small companies, studios, and personal websites.

1.4K
Active
Go
API Frameworks
Security Research
#web-security#firewall#lightweight

adleroliveira/dreamjs

A lightweight JSON data generator for creating mock data in development and testing.

1.4K
Archived
JavaScript
General Utilities
Node
#data-generation#mock-data#testing

aws-cloudformation/cloudformation-guard

A policy-as-code DSL to validate CloudFormation, Kubernetes, and Terraform configurations against custom rules.

1.4K
Active
Rust
Infrastructure as Code
CLI Tools
#cloudformation#policy-as-code#compliance

B16f00t/whapa

A Python-based toolset for parsing and analyzing WhatsApp chat data for forensic analysis.

1.4K
Stable
Python
API Frameworks
CLI Tools
#whatsapp#forensics#encryption

nilearn/nilearn

A Python library for machine learning on neuroimaging data, providing a high-level API for brain imaging analysis.

1.4K
Active
Python
Machine Learning Ops
Databases
Python
#brain-imaging#neuroimaging#machine-learning

durable-streams/durable-streams

An open protocol for real-time data synchronization between server and client applications.

1.4K
Active
TypeScript
Realtime
API Clients & Testing
React
#real-time#data-synchronization#streaming
1...132134...175

Stay in the loop

Get weekly updates on trending AI coding tools and projects.