Showing 41-60 of 123 projects
A powerful multimodal AI model for real-time vision and speech interaction, built for developers who work with AI tools.
An end-to-end multimodal SVG generator that leverages pre-trained Vision-Language Models to create complex and detailed SVGs.
PartCrafter is a 3D mesh generation tool that uses compositional latent diffusion transformers to create structured 3D objects.
Scrapes Twitter API search results and user profiles with authorization support.
A GitHub repository with a list of 2025 & 2026 new grad full-time roles in SWE, Quant, and PM.
An open-source implementation of a Flow Matching Model for training AI agents via online reinforcement learning.
A crowdsourced list of Canadian tech companies hiring interns and new grads for 2025
A comprehensive paper list and resource repository for Embodied AI research and development.
A latent diffusion transformer for generating high-quality videos, suitable for vibe coders building AI-powered applications.
A robotics research project focused on aligning simulation and real-world physics for learning agile humanoid whole-body skills.
Magma is a foundation model for building multimodal AI agents, enabling next-gen AI applications.
OminiControl is a minimal and universal control system for diffusion transformer models like DALL-E and Stable Diffusion.
A powerful multimodal transformer for combining language, vision, and other modalities in AI applications.
LongWriter is a fine-tuned large language model (LLM) that can generate high-quality long-form text of 10,000+ words from long-form context.
A research project from Facebook that explores multimodal AI models for computer vision and language tasks.
SpringBlade is a commercial-grade microservices platform built with Spring Boot 3.5 and Spring Cloud 2025.
A PyTorch-based library for consistent depth estimation in super-long videos using transformers.
A comprehensive project covering AI-powered coding tools, MCP frameworks, and backend-as-a-service for building AI-driven applications.
A Python bot that automates increasing TikTok video views for content creators.
Open-source end-to-end vision-language-action model for GUI agents and computer usage analysis.
Get weekly updates on trending AI coding tools and projects.