Explore Projects

Discover 125 open source projects

Active filters (1):
Search: data-analysisร—
Clear all

Showing 81-100 of 125 projects

WenjieDu/PyPOTS

A Python toolkit for building machine/deep learning models on partially-observed time series data

2.0K
Active
Python
Machine Learning Ops
Databases
PyTorch
#time-series#anomaly-detection#forecasting

CannyLab/tsne-cuda

GPU-accelerated t-SNE library for data visualization and dimensionality reduction.

1.9K
Archived
Cuda
ML Ops
Data Visualization
Python
#data-visualization#dimensionality-reduction#gpu-acceleration

bellingcat/octosuite

Terminal-based toolkit for analyzing GitHub data for OSINT and data analysis purposes.

1.9K
Active
Python
CLI Tools
Data Analysis
Python
#github-analysis#osint#data-analysis

h2oai/datatable

A high-performance, memory-efficient Python data analysis library for handling large datasets.

1.9K
Experimental
C++
Databases
CLI Tools
Python
#data-analysis#performance#memory-efficient

apachecn/python_data_analysis_and_mining_action

This Python repository contains code examples and notes for data analysis and mining.

1.8K
Archived
Python
Data Analysis
Tutorials & Courses
#data-analysis#data-science#python3

404notf0und/AI-for-Security-Learning

An open-source project that explores the use of AI techniques in security-related tasks, including data analysis and algorithm development.

1.8K
Archived
ML Ops
Security Research
#security#data-analysis#machine-learning

microsoft/responsible-ai-toolbox

A suite of tools that enable developers to build and monitor AI systems more responsibly.

1.7K
Active
TypeScript
Explainability
Fairness
TypeScript
#responsible-ai#explainable-ai#fairness-ai

Litlyx/litlyx

An open-source, self-hostable analytics platform built with TypeScript and Next.js that provides a simple, AI-powered dashboard for tracking website metrics.

1.7K
Stable
TypeScript
Analytics & Tracking
Full-Stack Frameworks
Next.js
#analytics#data-visualization#self-hostable

ptyadana/SQL-Data-Analysis-and-Visualization-Projects

This GitHub repository contains SQL data analysis and visualization projects using various tools and databases.

1.7K
Archived
Jupyter Notebook
Databases
ETL & Pipelines
#sql#data-analysis#data-visualization

jadianes/spark-py-notebooks

Apache Spark and Python tutorials for big data analysis and machine learning as Jupyter notebooks.

1.7K
Archived
Jupyter Notebook
Databases
ETL & Pipelines
Jupyter Notebook
#big-data#data-analysis#data-science

datageartech/datagear

DataGear is a data visualization and business intelligence platform that allows developers to build custom dashboards.

1.7K
Active
Java
Charts & Visualization
Databases
React
#data-visualization#business-intelligence#charts

starpig1129/DATAGEN

DATAGEN is an AI-driven multi-agent research assistant that automates hypothesis generation, data analysis, and report writing.

1.6K
Active
Python
Agents & Orchestration
LLM Frameworks
LangChain
#agent#ai#data-analysis

justmarkham/DAT8

General Assembly's 2015 Data Science course covering topics like machine learning, data analysis, and data visualization.

1.6K
Archived
Jupyter Notebook
Tutorials & Courses
Jupyter Notebook
#data-analysis#data-science#machine-learning

re-data/re-data

A data quality and observability tool for monitoring and fixing data issues before they become problems.

1.6K
Archived
HTML
ETL & Pipelines
CLI Tools
dbt
#data-quality#data-observability#data-monitoring

capitalone/DataProfiler

A Python library for extracting schema, statistics, and entities from datasets, useful for data profiling and privacy analysis.

1.5K
Stable
Python
ETL & Pipelines
CLI Tools
Python
#data-profiling#data-analysis#privacy

ecmadao/hacknical

Hacknical is a GitHub user-focused tool for creating better resumes by analyzing GitHub contributions and activity.

1.5K
Stable
JavaScript
Component Libraries (React)
Data Analysis
React
#github-analysis#resume-builder#data-visualization

hi-primus/optimus

Agile data preparation workflows made easy with popular Python data science libraries.

1.5K
Archived
Python
ETL & Pipelines
API Frameworks
#big-data-cleaning#data-analysis#data-cleaning

nubank/fklearn

fklearn: A functional machine learning library for Python.

1.5K
Experimental
Jupyter Notebook
React
#machine learning#python#data analysis

sepandhaghighi/pycm

A Python library for creating multi-class confusion matrices, useful for evaluating machine learning models.

1.5K
Active
Python
ML Ops
Data Analysis
#accuracy#classification#confusion-matrix

DataBrewery/cubes

A lightweight Python OLAP framework for multi-dimensional data analysis and reporting.

1.5K
Archived
Python
ORMs & Query Builders
Databases
#olap#data-analysis#multidimensional-data

Stay in the loop

Get weekly updates on trending AI coding tools and projects.