Skip to content

amrzv/awesome-colab-notebooks

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

66 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Hits awesome-colab-notebooks

Word Cloud

The page might not be rendered properly. Please open README.md file directly

Awesome colab notebooks collection for ML experiments

Trending

repositories papers packages
  • agent-starter-pack
  • Qwen2.5-Omni
  • verl
  • opik
  • courses
  • crawl4ai
  • TabPFN
  • langgraph
  • litellm
  • DL_intro
  • langfun
  • SAELens
  • ARENA_3.0
  • TransformerLens
  • InvSR
  • tapnet
  • unsloth
  • trl
  • YuE
  • STAR
  • Show-1
  • MARS5-TTS
  • lm-evaluation-harness
  • cleanrl
  • DiffSynth-Studio
  • ComfyUI
  • rlds
  • llama-recipes
  • mujoco
  • PGMax
  • dinov2
  • instructor
  • autogen
  • ultralytics
  • segment-anything-2
  • xland-minigrid
  • ModernBERT
  • rl-baselines3-zoo
  • myosuite
  • GroundingDINO
  • ollama
  • dsp-theory
  • Retrieval-based-Voice-Conversion-WebUI
  • NeMo
  • evidently
  • xla
  • Qwen
  • stable-baselines3
  • torchgeo
  • datachain
    • opik
    • presidio-structured
    • Crawl4AI
    • presidio-anonymizer
    • presidio-analyzer
    • loralib
    • langchain
    • reformer-pytorch
    • xmanager
    • giskard
    • litellm
    • agent-starter-pack
    • imagededup
    • bert-score
    • unsloth
    • langgraph
    • ollama
    • google-generativeai
    • mistral-inference
    • sglang
    • folium
    • transformer-lens
    • google-cloud-aiplatform
    • mmsegmentation
    • llama-index
    • supervision
    • autofaiss
    • mmdet
    • mmpose
    • panda-gym
    • neural-tangents
    • sae-lens
    • rl-games
    • dopamine-rl
    • gin-config
    • google-cloud-discoveryengine
    • pybullet
    • tensorflow-datasets
    • tfx
    • tensorflow-federated
    • tensorflow-data-validation
    • earthengine-api
    • catboost
    • dm-reverb
    • tensorflow-gnn
    • guidance
    • scikit-video
    • tensor2tensor
    • rudolph
    • img2dataset

    Courses

    COURSES
    name description authors links colaboratory update
    LLM Engineering Essentials course 12-week course, created by experts from academia and industry, is designed specifically for developers and engineers Open In Colab 08.05.2025
    Deep Learning School course (ML + CV) Nina Konovalova Open In Colab 14.02.2025
    Introduction to Deep Learning course Tatiana Gaintseva
    • medium, medium
    • tf
    • wiki
    • yt, yt, yt, yt
    Open In Colab 24.01.2025
    ARENA Provide talented individuals with the skills, tools, and environment necessary for upskilling in ML engineering, for the purpose of contributing directly to AI alignment in technical roles Callum McDougall Open In Colab 30.12.2024
    The Autodiff Cookbook You'll go through a whole bunch of neat autodiff ideas that you can cherry pick for your own work, starting with the basics Open In Colab 20.09.2024
    Machine Learning Simplified A Gentle Introduction to Supervised Learning Andrew Wolf Open In Colab 29.08.2024
    Anthropic courses Anthropic's educational courses Anthropic
    • docs
    • reddit
    Open In Colab 22.08.2024
    mlcourse.ai Open Machine Learning Course Yury Kashnitsky Open In Colab 19.08.2024
    Deep RL Course The Hugging Face Deep Reinforcement Learning Course Open In Colab 24.06.2024
    Generative AI for Beginners - A Course A 12 Lesson course teaching everything you need to know to start building Generative AI applications microsoft Open In Colab 22.02.2024
    DSP theory Theory of digital signal processing: signals, filtration (IIR, FIR, CIC, MAF), transforms (FFT, DFT, Hilbert, Z-transform) etc Open In Colab 18.10.2022
    Machine learning course This course is broad and shallow, but author will provide additional links so that you can deepen your understanding of the ML method you need Тимчишин Віталій Open In Colab 02.09.2021
    NYU-DLSP20 This course concerns the latest techniques in deep learning and representation learning, focusing on supervised and unsupervised deep learning, embedding methods, metric learning, convolutional and recurrent nets, with applications to computer vision, natural language understanding, and speech recognition Open In Colab 30.10.2019

    Research

    RESEARCH
    name description authors links colaboratory update
    Qwen2.5-Omni End-to-end multimodal model designed to perceive diverse modalities, including text, images, audio, and video, while simultaneously generating text and natural speech responses in a streaming manner Open In Colab 29.04.2025
    SigLIP 2 Family of new multilingual vision-language encoders that build on the success of the original SigLIP
    • arxiv, arxiv
    • git, git, git
    • hf
    • medium, medium
    • yt
    Open In Colab 17.03.2025
    STAR Spatial Temporal Augmentation with T2V models for Real-world video super-resolution, a novel approach that leverages T2V models for real-world video super-resolution, achieving realistic spatial details and robust temporal consistency Open In Colab 22.01.2025
    InvSR Image super-resolution technique based on diffusion inversion, aiming at harnessing the rich image priors encapsulated in large pre-trained diffusion models to improve SR performance
    • arxiv
    • git
    • hf, hf
    • yt
    Open In Colab 21.01.2025
    ModernBERT Bringing modern model optimizations to encoder-only models and representing a major Pareto improvement over older encoders
    • arxiv
    • git, git
    • hf
    • medium
    • yt, yt, yt, yt
    Open In Colab 22.12.2024
    TAPIR Tracking Any Point with per-frame Initialization and temporal Refinement Open In Colab 30.11.2024
    PuLID Pure and Lightning ID customization, a tuning-free ID customization method for text-to-image generation
    • arxiv
    • git, git, git
    • reddit
    Open In Colab 09.11.2024
    CoTracker Architecture that jointly tracks multiple points throughout an entire video Open In Colab 16.10.2024
    Segment Anything 2 Foundation model towards solving promptable visual segmentation in images and videos Open In Colab 01.10.2024
    Deep Painterly Harmonization Algorithm produces significantly better results than photo compositing or global stylization techniques and that it enables creative painterly edits that would be otherwise difficult to achieve
    • arxiv, arxiv
    • git, git, git
    Open In Colab 23.09.2024
    audio2photoreal Framework for generating full-bodied photorealistic avatars that gesture according to the conversational dynamics of a dyadic interaction Open In Colab 13.09.2024
    Fast Segment Anything CNN Segment Anything Model trained using only 2% of the SA-1B dataset published by SAM authors
    • arxiv, arxiv
    • git
    • medium
    • yt, yt, yt
    Open In Colab 10.09.2024
    Neuralangelo Framework for high-fidelity 3D surface reconstruction from RGB video captures Open In Colab 02.09.2024
    YOLOv10 Aim to further advance the performance-efficiency boundary of YOLOs from both the post-processing and model architecture Open In Colab 20.08.2024
    SpecVQGAN Taming the visually guided sound generation by shrinking a training dataset to a set of representative vectors Open In Colab 12.07.2024
    LivePortrait Video-driven portrait animation framework with a focus on better generalization, controllability, and efficiency for practical usage Open In Colab 10.07.2024
    StoryDiffusion Way of self-attention calculation, termed Consistent Self-Attention, that significantly boosts the consistency between the generated images and augments prevalent pretrained diffusion-based text-to-image models in a zero-shot manner Open In Colab 04.05.2024
    VoiceCraft token infilling neural codec language model, that achieves state-of-the-art performance on both speech editing and zero-shot text-to-speech on audiobooks, internet videos, and podcasts Open In Colab 21.04.2024
    ZeST Method for zero-shot material transfer to an object in the input image given a material exemplar image Open In Colab 16.04.2024
    InstantMesh Feed-forward framework for instant 3D mesh generation from a single image, featuring state-of-the-art generation quality and significant training scalability
    • arxiv
    • git, git, git
    • hf
    • reddit
    • yt
    Open In Colab 16.04.2024
    Würstchen Architecture for text-to-image synthesis that combines competitive performance with unprecedented cost-effectiveness for large-scale text-to-image diffusion models
    • arxiv
    • hf
    • reddit
    • yt
    Open In Colab 06.04.2024
    AudioSep Foundation model for open-domain audio source separation with natural language queries Open In Colab 15.03.2024
    AQLM Extreme Compression of Large Language Models via Additive Quantization
    • arxiv
    • hf, hf, hf
    • reddit
    • yt, yt
    Open In Colab 08.03.2024
    YOLOv9 Learning What You Want to Learn Using Programmable Gradient Information Open In Colab 05.03.2024
    Multi-LoRA Composition LoRA Switch and LoRA Composite, approaches that aim to surpass traditional techniques in terms of accuracy and image quality, especially in complex compositions Open In Colab 03.03.2024
    AMARETTO Multiscale and multimodal inference of regulatory networks to identify cell circuits and their drivers shared and distinct within and across biological systems of human disease Open In Colab 28.02.2024
    ViT Vision Transformer and MLP-Mixer Architectures Open In Colab 06.02.2024
    Qwen Comprehensive language model series that encompasses distinct models with varying parameter counts qwenlm
    • arxiv, arxiv, arxiv, arxiv
    • discord
    • docker
    • git, git, git, git, git, git
    • hf
    • pt
    • yt, yt, yt
    Open In Colab 30.01.2024
    VALL-E X Cross-lingual neural codec language model for cross-lingual speech synthesis Open In Colab 19.01.2024
    PhotoMaker Efficient personalized text-to-image generation method, which mainly encodes an arbitrary number of input ID images into a stack ID embedding for preserving ID information Open In Colab 18.01.2024
    DDColor End-to-end method with dual decoders for image colorization
    • arxiv
    • git, git
    Open In Colab 15.01.2024
    PASD Pixel-aware stable diffusion network to achieve robust Real-ISR as well as personalized stylization
    • arxiv
    • git
    • hf, hf
    • reddit
    Open In Colab 12.01.2024
    HandRefiner Refining Malformed Hands in Generated Images by Diffusion-based Conditional Inpainting
    • arxiv
    • git, git, git
    • reddit
    • yt
    Open In Colab 08.01.2024
    LLaVA Large Language and Vision Assistant, an end-to-end trained large multimodal model that connects a vision encoder and LLM for general-purpose visual and language understanding Open In Colab 22.12.2023
    SMPLer-X Scaling up EHPS towards the first generalist foundation model, with up to ViT-Huge as the backbone and training with up to 4.5M instances from diverse data sources Open In Colab 18.12.2023
    DeepCache Training-free paradigm that accelerates diffusion models from the perspective of model architecture Open In Colab 18.12.2023
    MagicAnimate Diffusion-based framework that aims at enhancing temporal consistency, preserving reference image faithfully, and improving animation fidelity Open In Colab 18.12.2023
    DiffBIR Towards Blind Image Restoration with Generative Diffusion Prior Open In Colab 18.12.2023
    Segment and Track Anything Framewoork that allows users to precisely and effectively segment and track any object in a video
    • arxiv, arxiv, arxiv
    • git
    • hf
    • neurips, neurips
    • yt, yt, yt, yt, yt, yt, yt, yt, yt, yt, yt, yt
    Open In Colab 08.12.2023
    AudioLDM Text-to-audio system that is built on a latent space to learn the continuous audio representations from contrastive language-audio pretraining latents Open In Colab 02.12.2023
    TabPFN Neural network that learned to do tabular data prediction Open In Colab 29.11.2023
    Concept Sliders Plug-and-play low rank adaptors applied on top of pretrained models Open In Colab 26.11.2023
    Qwen-VL Set of large-scale vision-language models designed to perceive and understand both text and images Open In Colab 24.11.2023
    PixArt-Σ Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation Open In Colab 07.11.2023
    Zero123++ Image-conditioned diffusion model for generating 3D-consistent multi-view images from a single input view
    • arxiv
    • git, git
    • hf, hf
    • medium
    • reddit
    • yt
    Open In Colab 26.10.2023
    Show-1 Hybrid model, dubbed as Show-1, which marries pixel-based and latent-based VDMs for text-to-video generation Open In Colab 15.10.2023
    DA-CLIP Degradation-aware vision-language model to better transfer pretrained vision-language models to low-level vision tasks as a universal framework for image restoration Open In Colab 11.10.2023
    Musika Music generation system that can be trained on hundreds of hours of music using a single consumer GPU, and that allows for much faster than real-time generation of music of arbitrary length on a consumer CPU Open In Colab 09.10.2023
    YOLOv6 Single-stage object detection framework dedicated to industrial applications Open In Colab 08.10.2023
    DreamGaussian Algorithm to convert 3D Gaussians into textured meshes and apply a fine-tuning stage to refine the details Open In Colab 04.10.2023
    DINOv2 Produce high-performance visual features that can be directly employed with classifiers as simple as linear layers on a variety of computer vision tasks; these visual features are robust and perform well across domains without any requirement for fine-tuning Open In Colab 31.08.2023
    StyleGAN 3 Alias-Free Generative Adversarial Networks Open In Colab 13.08.2023
    Big GAN Large Scale GAN Training for High Fidelity Natural Image Synthesis
    • arxiv
    Open In Colab 03.08.2023
    CutLER Simple approach for training unsupervised object detection and segmentation models Open In Colab 24.07.2023
    Recognize Anything & Tag2Text Vision language pre-training framework, which introduces image tagging into vision-language models to guide the learning of visual-linguistic features Open In Colab 09.07.2023
    MobileSAM Towards Lightweight SAM for Mobile Applications
    • arxiv
    • git, git, git, git, git, git, git, git
    • twitter
    • yt
    Open In Colab 30.06.2023
    Grounding DINO Marrying DINO with Grounded Pre-Training for Open-Set Object Detection
    • arxiv
    • git, git, git, git, git, git, git
    • pwc, pwc, pwc, pwc
    • yt, yt, yt, yt
    Open In Colab 28.06.2023
    T5X Modular, composable, research-friendly framework for high-performance, configurable, self-service training, evaluation, and inference of sequence models at many scales
    • arxiv, arxiv
    • docs
    • git, git
    • tf, tf, tf
    Open In Colab 27.06.2023
    Gen-L-Video Extending off-the-shelf short video diffusion models for generating and editing videos comprising hundreds of frames with diverse semantic segments without introducing additional training, all while preserving content consistency Open In Colab 04.06.2023
    First Order Motion Model for Image Animation Transferring facial movements from video to image Aliaksandr Siarohin Open In Colab 04.06.2023
    MMS The Massively Multilingual Speech project expands speech technology from about 100 languages to over 1000 by building a single multilingual speech recognition model supporting over 1100 languages, language identification models able to identify over 4000 languages, pretrained models supporting over 1400 languages, and text-to-speech models for over 1100 languages
    • arxiv
    • hf, hf, hf
    • meta
    • yt, yt
    Open In Colab 26.05.2023
    FAB Flow AIS Bootstrap uses AIS to generate samples in regions where the flow is a poor approximation of the target, facilitating the discovery of new modes
    • arxiv
    • git, git
    • yt
    Open In Colab 29.04.2023
    CodeFormer Transformer-based prediction network to model global composition and context of the low-quality faces for code prediction, enabling the discovery of natural faces that closely approximate the target faces even when the inputs are severely degraded Open In Colab 21.04.2023
    Segment Anything The Segment Anything Model produces high quality object masks from input prompts such as points or boxes, and it can be used to generate masks for all objects in an image Open In Colab 10.04.2023
    EVA3D High-quality unconditional 3D human generative model that only requires 2D image collections for training Open In Colab 06.04.2023
    Stable Dreamfusion Using a pretrained 2D text-to-image diffusion model to perform text-to-3D synthesis Open In Colab 04.04.2023
    Visual ChatGPT Connects ChatGPT and a series of Visual Foundation Models to enable sending and receiving images during chatting
    • arxiv
    • git, git, git, git
    • yt, yt
    Open In Colab 15.03.2023
    GPEN GAN Prior Embedded Network for Blind Face Restoration in the Wild Open In Colab 15.02.2023
    Disco Diffusion A frankensteinian amalgamation of notebooks, models and techniques for the generation of AI Art and Animations
    • git
    • yt, yt, yt
    Open In Colab 11.02.2023
    GrooVAE Some applications of machine learning for generating and manipulating beats and drum performances Open In Colab 02.02.2023
    Multitrack MusicVAE The models in this notebook are capable of encoding and decoding single measures of up to 8 tracks, optionally conditioned on an underlying chord Open In Colab 02.02.2023
    MusicVAE A Hierarchical Latent Vector Model for Learning Long-Term Structure in Music Open In Colab 02.02.2023
    LORA Low-Rank Adaptation, which freezes the pre-trained model weights and injects trainable rank decomposition matrices into each layer of the Transformer architecture, greatly reducing the number of trainable parameters for downstream tasks
    • arxiv, arxiv, arxiv, arxiv, arxiv
    • git
    • hf, hf
    • medium, medium
    • pypi
    • reddit, reddit
    • yt, yt, yt, yt, yt, yt
    Open In Colab 30.01.2023
    Fourier Feature Networks Fourier Features Let Networks Learn High Frequency Functions in Low Dimensional Domains Open In Colab 17.01.2023
    Demucs Hybrid Spectrogram and Waveform Source Separation Alexandre Défossez
    • arxiv, arxiv, arxiv, arxiv
    • git, git, git, git
    Open In Colab 21.11.2022
    MotionDiffuse The first diffusion model-based text-driven motion generation framework, which demonstrates several desired properties over existing methods Open In Colab 13.10.2022
    PyMAF Pyramidal Mesh Alignment Feedback loop in regression network for well-aligned body mesh recovery and extend it for the recovery of expressive full-body models Open In Colab 06.10.2022
    Functa From data to functa: Your data point is a function and you can treat it like one
    • arxiv
    • git, git
    • tf
    Open In Colab 24.09.2022
    Whisper Automatic speech recognition system trained on 680,000 hours of multilingual and multitask supervised data collected from the web Open In Colab 21.09.2022
    DeOldify (video) Colorize your own videos! Jason Antic Open In Colab 19.09.2022
    DeOldify (photo) Colorize your own photos! Open In Colab 19.09.2022
    Decision Transformers An architecture that casts the problem of RL as conditional sequence modeling Open In Colab 06.09.2022
    textual-inversion An Image is Worth One Word: Personalizing Text-to-Image Generation using Textual Inversion Open In Colab 21.08.2022
    Make-A-Scene Scene-Based Text-to-Image Generation with Human Priors
    • arxiv
    • yt
    Open In Colab 12.08.2022
    YOLOv7 Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors Open In Colab 09.08.2022
    OPT Open Pre-trained Transformers is a family of NLP models trained on billions of tokens of text obtained from the internet Open In Colab 29.06.2022
    Customizing a Transformer Encoder We will learn how to customize the encoder to employ new network architectures Chen Chen
    • arxiv
    • git
    Open In Colab 22.06.2022
    SimCTG Contrastive training objective to calibrate the model's representation space, and a decoding method -- contrastive search -- to encourage diversity while maintaining coherence in the generated text
    • arxiv, arxiv
    • git, git
    • hf, hf, hf
    • neurips
    • pypi
    Open In Colab 04.06.2022
    T0 Multitask Prompted Training Enables Zero-Shot Task Generalization
    • arxiv
    • yt, yt
    Open In Colab 29.05.2022
    Text2Mesh Text-Driven Neural Stylization for Meshes Open In Colab 14.05.2022
    T5 Text-To-Text Transfer Transformer
    • arxiv
    • git
    • tf
    Open In Colab 11.05.2022
    XLS-R Self-supervised Cross-lingual Speech Representation Learning at Scale Open In Colab 10.05.2022
    MAGIC Training-free framework, iMAge-Guided text generatIon with CLIP, for plugging in visual controls in the generation process and enabling LMs to perform multimodal tasks in a zero-shot manner
    • arxiv
    Open In Colab 02.05.2022
    DiffCSE Unsupervised contrastive learning framework for learning sentence embeddings
    • arxiv, arxiv, arxiv
    • git
    • hf
    • twitter
    Open In Colab 24.04.2022
    ViDT+ An Extendable, Efficient and Effective Transformer-based Object Detector
    • arxiv, arxiv
    • git, git
    Open In Colab 20.04.2022
    GP-UNIT Novel framework, Generative Prior-guided UNsupervised Image-to-image Translation, to improve the overall quality and applicability of the translation algorithm Open In Colab 02.04.2022
    CLIPasso Semantically-Aware Object Sketching Open In Colab 21.03.2022
    Disentangled Lifespan Face Synthesis LFS model is proposed to disentangle the key face characteristics including shape, texture and identity so that the unique shape and texture age transformations can be modeled effectively Open In Colab 22.02.2022
    ClipCap CLIP Prefix for Image Captioning Open In Colab 15.02.2022
    Pose with Style Detail-Preserving Pose-Guided Image Synthesis with Conditional StyleGAN Open In Colab 19.01.2022
    diffsort Differentiable Sorting Networks
    • arxiv, arxiv
    • yt
    Open In Colab 17.01.2022
    GLIDE Towards Photorealistic Image Generation and Editing with Text-Guided Diffusion Models
    • arxiv
    • yt
    Open In Colab 22.12.2021
    SOAT StyleGAN of All Trades: Image Manipulation with Only Pretrained StyleGAN
    • arxiv
    • git, git
    • hf
    Open In Colab 13.11.2021
    Arnheim Generative Art Using Neural Visual Grammars and Dual Encoders
    • arxiv, arxiv, arxiv, arxiv, arxiv
    • git
    • wiki
    • yt, yt, yt, yt
    Open In Colab 11.11.2021
    GPT-2 Retrain an advanced text generating neural network on any text dataset using gpt-2-simple! Max Woolf Open In Colab 18.10.2021
    ConvMixer An extremely simple model that is similar in spirit to the ViT and the even-more-basic MLP-Mixer in that it operates directly on patches as input, separates the mixing of spatial and channel dimensions, and maintains equal size and resolution throughout the network
    • arxiv
    • git, git
    • medium
    • yt
    Open In Colab 06.10.2021
    IC-GAN Instance-Conditioned GAN Open In Colab 01.10.2021
    VITS Parallel end-to-end TTS method that generates more natural sounding audio than current two-stage models Open In Colab 23.08.2021
    CogView Mastering Text-to-Image Generation via Transformers Open In Colab 21.06.2021
    GANs N' Roses Stable, Controllable, Diverse Image to Image Translation
    • arxiv, arxiv
    • git, git
    • yt
    Open In Colab 19.06.2021
    Score SDE Score-Based Generative Modeling through Stochastic Differential Equations
    • arxiv, arxiv, arxiv, arxiv
    • git, git
    • yt
    Open In Colab 18.03.2021
    Talking Head Anime from a Single Image The network takes as input an image of an anime character's face and a desired pose, and it outputs another image of the same character in the given pose Pramook Khungurn Open In Colab 23.02.2021
    NFNet An adaptive gradient clipping technique, a significantly improved class of Normalizer-Free ResNets
    • arxiv, arxiv
    • git
    • yt, yt
    Open In Colab 17.02.2021
    CLIP A neural network which efficiently learns visual concepts from natural language supervision Open In Colab 29.01.2021
    Adversarial Patch A method to create universal, robust, targeted adversarial image patches in the real world Tom Brown
    • arxiv
    Open In Colab 27.01.2021
    MusicXML Documentation The goal of this notebook is to explore one of the magenta libraries for music Open In Colab 08.01.2021
    Neural Magic Eye Learning to See and Understand the Scene Behind an Autostereogram Open In Colab 01.01.2021
    SIREN Implicit Neural Representations with Periodic Activation Functions Open In Colab 25.06.2020
    Onsets and Frames Onsets and Frames is an automatic music transcription framework with piano and drums models Open In Colab 02.04.2020
    FBA Matting Low-cost modification to alpha matting networks to also predict the foreground and background colours
    • arxiv
    • git
    • hf
    • pwc
    Open In Colab 19.03.2020
    BERT score An automatic evaluation metric for text generation Tianyi Zhang
    • arxiv
    • pypi
    Open In Colab 05.03.2020
    ProxylessNAS Directly learn the architectures for large-scale target tasks and target hardware platforms
    • arxiv, arxiv, arxiv
    • medium
    • pt
    • reddit
    • yt, yt
    Open In Colab 29.10.2019
    Generating Piano Music with Transformer This Colab notebook lets you play with pretrained Transformer models for piano music generation, based on the Music Transformer Open In Colab 16.09.2019
    GANSynth This notebook is a demo GANSynth, which generates audio with Generative Adversarial Networks Jesse Engel Open In Colab 25.02.2019
    Latent Constraints Conditional Generation from Unconditional Generative Models Open In Colab 27.11.2017
    Performance RNN This notebook shows you how to generate new performed compositions from a trained model Open In Colab 11.07.2017
    NSynth This colab notebook has everything you need to upload your own sounds and use NSynth models to reconstruct and interpolate between them Open In Colab 06.04.2017

    Tutorials

    TUTORIALS
    name description authors links colaboratory update
    YOLOv8 State-of-the-art model that builds upon the success of previous YOLO versions and introduces new features and improvements to further boost performance and flexibility Glenn Jocher Open In Colab 01.05.2025
    dm_control DeepMind Infrastructure for Physics-Based Simulation Open In Colab 30.04.2025
    MuJoCo A general purpose physics engine that aims to facilitate research and development in robotics, biomechanics, graphics and animation, machine learning, and other areas which demand fast and accurate simulation of articulated structures interacting with their environment Open In Colab 30.04.2025
    Optimum Extension of Transformers and Diffusers, providing a set of optimization tools enabling maximum efficiency to train and run models on targeted hardware, while keeping things easy to use Hugging Face
    • git
    • hf, hf
    • yt, yt, yt
    Open In Colab 29.04.2025
    Ray Unified framework for scaling AI and Python applications Open In Colab 29.04.2025
    verl HybridFlow, which combines single-controller and multi-controller paradigms in a hybrid manner to enable flexible representation and efficient execution of the RLHF dataflow
    • arxiv, arxiv, arxiv, arxiv
    • docs
    • git, git, git, git, git, git, git, git
    • slack
    • twitter
    Open In Colab 29.04.2025
    LangGraph Library for building stateful, multi-actor applications with LLMs, used to create agent and multi-agent workflows William FH Open In Colab 25.04.2025
    Brax A differentiable physics engine that simulates environments made up of rigid bodies, joints, and actuators
    • arxiv
    • neurips
    Open In Colab 22.04.2025
    Opik From RAG chatbots to code assistants to complex agentic pipelines and beyond, build LLM systems that run better, faster, and cheaper with tracing, evaluations, and dashboards Jacques Verré Open In Colab 21.04.2025
    YOLOv3 You Only Look Once Glenn Jocher Open In Colab 18.04.2025
    YOLOv5 You Only Look Once Glenn Jocher Open In Colab 18.04.2025
    Giskard Open-source library to detect hallucinations and security issues to turn them into test suites that you can automatically execute
    • discord
    • docs
    • medium
    • pypi
    • yt, yt, yt, yt
    Open In Colab 16.04.2025
    AutoGen Framework that enables development of LLM applications using multiple agents that can converse with each other to solve tasks microsoft Open In Colab 15.04.2025
    Evidently An open-source framework to evaluate, test and monitor ML models in production Open In Colab 08.04.2025
    Llama 4 Open-weight natively multimodal models with unprecedented context length support and our first built using a MoE architecture meta
    • docs
    • git, git
    • hf
    • medium
    • meta
    • reddit
    • yt, yt, yt, yt, yt, yt
    Open In Colab 07.04.2025
    Agent Starter Pack Collection of production-ready Generative AI Agent templates built for Google Cloud Kristopher Overholt
    • medium, medium
    • pypi
    • reddit
    • yt, yt, yt, yt
    Open In Colab 31.03.2025
    Vertex AI Search brings together the power of deep information retrieval, state-of-the-art natural language processing, and the latest in LLM processing to understand user intent and return the most relevant results for the user Megha Agarwal Open In Colab 31.03.2025
    xFormers Toolbox to Accelerate Research on Transformers
    • docs
    • git, git, git, git, git, git, git, git
    • yt
    Open In Colab 25.03.2025
    SkyThought Train your own O1 preview model within $450 Sumanth Hegde Open In Colab 20.03.2025
    Sentence Transformers Multilingual Sentence, Paragraph, and Image Embeddings using BERT & Co
    • arxiv, arxiv, arxiv
    • docs
    Open In Colab 07.03.2025
    Crawl4AI LLM Friendly Web Crawler & Scrapper UncleCode
    • docs
    • medium
    • pypi
    • twitter
    • yt, yt, yt, yt
    Open In Colab 28.02.2025
    LangChain Framework for developing applications powered by large language models Bagatur
    • docs
    • git, git, git, git
    • medium, medium, medium
    • pypi
    • twitter
    • wiki
    • yt, yt, yt, yt, yt, yt, yt, yt, yt
    Open In Colab 25.02.2025
    LM Evaluation Harness Framework for few-shot evaluation of language models. Lintang Sutawika Open In Colab 21.02.2025
    Multimodal Maestro Gives you more control over large multimodal models to get the outputs you want Piotr Skalski Open In Colab 18.02.2025
    YuE Groundbreaking series of open-source foundation models designed for music generation, specifically for transforming lyrics into full songs Mozer Open In Colab 16.02.2025
    Datasets A Community Library for Natural Language Processing
    • arxiv
    • docs
    • hf
    • kaggle
    • yt
    Open In Colab 13.02.2025
    moondream Tiny vision language model that kicks ass and runs anywhere Vik Korrapati Open In Colab 06.02.2025
    VC Client software for performing real-time voice conversion using various Voice Conversion AI w-okada
    • git
    • hf
    • yt, yt, yt, yt, yt, yt, yt, yt, yt
    Open In Colab 30.01.2025
    Building Your Own Federated Learning Algorithm We discuss how to implement federated learning algorithms without deferring to the tff.learning API Zachary Charles Open In Colab 29.01.2025
    Federated Learning for Image Classification We use the classic MNIST training example to introduce the Federated Learning API layer of TFF, tff.learning - a set of higher-level interfaces that can be used to perform common types of federated learning tasks, such as federated training, against user-supplied models implemented in TensorFlow Krzysztof Ostrowski Open In Colab 29.01.2025
    Federated Learning for Text Generation We start with a RNN that generates ASCII characters, and refine it via federated learning Krzysztof Ostrowski Open In Colab 29.01.2025
    Custom Federated Algorithms, Part 1: Introduction to the Federated Core This tutorial is the first part of a two-part series that demonstrates how to implement custom types of federated algorithms in TensorFlow Federated using the Federated Core - a set of lower-level interfaces that serve as a foundation upon which we have implemented the Federated Learning layer Krzysztof Ostrowski
    • arxiv
    • pwc
    • pypi
    • tf, tf
    Open In Colab 29.01.2025
    Custom Federated Algorithms, Part 2: Implementing Federated Averaging This tutorial is the second part of a two-part series that demonstrates how to implement custom types of federated algorithms in TFF using the Federated Core, which serves as a foundation for the Federated Learning layer Krzysztof Ostrowski
    • pwc
    • pypi
    • tf, tf
    Open In Colab 29.01.2025
    CodeGemma How to load, fine-tune and deploy CodeGemma model on SQL by utilising Hugging Face Carlo Fisicaro
    • arxiv
    • docs
    • hf
    • kaggle
    • reddit
    • yt
    Open In Colab 22.01.2025
    Invariant Agent Stack Framework-less approach that currently consists of three key projects, each of which can be used independently or in combination to build, test, and secure AI agents Fei Xie Open In Colab 20.01.2025
    Ollama Get up and running with large language models Michael Yang Open In Colab 13.01.2025
    NotebookLlama Open Source version of NotebookLM Sanyam Bhutani Open In Colab 09.01.2025
    Anomalib Deep learning library that aims to collect state-of-the-art anomaly detection algorithms for benchmarking on both public and private datasets Open In Colab 08.01.2025
    Hello, many worlds This tutorial shows how a classical neural network can learn to correct qubit calibration errors Michael Broughton
    • tf, tf, tf
    • wiki
    • yt
    Open In Colab 04.01.2025
    LightAutoML Allows you create machine learning models using just a few lines of code, or build your own custom pipeline using ready blocks Open In Colab 22.12.2024
    TorchGeo PyTorch domain library that provides datasets, transforms, samplers, and pre-trained models specific to geospatial data Open In Colab 19.12.2024
    Langfun PyGlove powered library that aims to make language models fun to work with Daiyi Peng
    • discord
    • git
    • pypi
    Open In Colab 16.12.2024
    ComfyUI Powerful and modular stable diffusion GUI and backend comfyanonymous Open In Colab 13.12.2024
    TransformerLens Library for doing mechanistic interpretability of GPT-2 Style language models
    • arxiv, arxiv
    • docs
    • git
    • medium
    • pypi
    • slack
    • yt, yt
    Open In Colab 09.12.2024
    Supervision Reusable computer vision tools Piotr Skalski Open In Colab 05.12.2024
    XManager Framework for managing machine learning experiment Andrew Chen Open In Colab 05.12.2024
    Flax Neural network library and ecosystem for JAX designed for flexibility
    • docs
    • hf
    • medium
    • reddit
    • yt, yt, yt
    Open In Colab 05.12.2024
    Haiku A library built on top of JAX designed to provide simple, composable abstractions for machine learning research Open In Colab 05.12.2024
    SGLang Fast serving framework for large language models and vision language models Open In Colab 05.12.2024
    SAE Lens Training Sparse Autoencoders on Language Models
    • docs
    • pypi
    • slack
    Open In Colab 03.12.2024
    Feast An open source feature store for machine learning Open In Colab 22.11.2024
    FiftyOne Open-source tool for building high-quality datasets and computer vision models Open In Colab 21.11.2024
    CatBoost High-performance open source library for gradient boosting on decision trees Open In Colab 18.11.2024
    Llama 3.1 First openly available model that rivals the top AI models when it comes to state-of-the-art capabilities in general knowledge, steerability, math, tool use, and multilingual translation unsloth Open In Colab 17.11.2024
    Mistral Small Enterprise-grade small model unsloth Open In Colab 17.11.2024
    DPO Zephyr Starting from a dataset of outputs ranked by a teacher model, we apply distilled direct preference optimization to learn a chat model with significantly improved intent alignment
    • arxiv
    • discord
    • git
    • hf, hf
    • medium, medium
    • pypi
    • reddit
    • twitter
    • yt, yt, yt, yt
    Open In Colab 17.11.2024
    ORPO Get up and running with large language models
    • arxiv
    • discord
    • git
    • hf, hf
    • medium, medium
    • pypi
    • reddit
    • yt, yt, yt
    Open In Colab 17.11.2024
    Phi-3.5 3.8 billion parameter language model trained on 3.3 trillion tokens, whose overall performance, as measured by both academic benchmarks and internal testing, rivals that of models such as Mixtral 8x7B and GPT-3.5, despite being small enough to be deployed on a phone unsloth Open In Colab 17.11.2024
    Simple audio recognition This tutorial will show you how to build a basic speech recognition network that recognizes ten different words Google Open In Colab 15.11.2024
    High-performance simulations with TFF This tutorial will describe how to setup high-performance simulations with TFF in a variety of common scenarios Krzysztof Ostrowski
    • pwc
    • pypi
    • tf
    Open In Colab 01.11.2024
    Autodistill Uses big, slower foundation models to train small, faster supervised models autodistill
    • blog post
    • docs
    • git, git, git, git, git, git, git, git, git, git, git, git, git, git, git, git, git
    • yt, yt, yt
    Open In Colab 01.11.2024
    Swarm Educational framework exploring ergonomic, lightweight multi-agent orchestration
    • medium, medium
    • reddit
    • yt, yt, yt, yt
    Open In Colab 15.10.2024
    TRL Set of tools to train transformer language models with Reinforcement Learning, from the Supervised Fine-tuning step, Reward Modeling step to the Proximal Policy Optimization step
    • arxiv
    • docs
    • git
    • yt, yt
    Open In Colab 24.09.2024
    TFX End-to-end platform for deploying production ML pipelines Open In Colab 18.09.2024
    TFDV TensorFlow Data Validation is a library for exploring and validating machine learning data Open In Colab 18.09.2024
    PEFT Parameter-Efficient Fine-Tuning methods enable efficient adaptation of pre-trained language models to various downstream applications without fine-tuning all the model's parameters Open In Colab 13.09.2024
    SAA+ Framework, Segment Any Anomaly +, for zero-shot anomaly segmentation with hybrid prompt regularization to improve the adaptability of modern foundation models
    • arxiv
    • git, git
    • hf
    Open In Colab 13.09.2024
    TensorRT SDK for high-performance deep learning inference, includes a deep learning inference optimizer and runtime that delivers low latency and high throughput for inference applications nvidia Open In Colab 12.09.2024
    guidance Enables you to control modern language models more effectively and efficiently than traditional prompting or chaining Scott Lundberg Open In Colab 09.09.2024
    Arena-Hard-Auto Automated pipeline that leverages LLMs to curate high-quality, open-ended prompts from large, crowd-sourced datasets, enabling continuous benchmark updates without human in the loop Open In Colab 09.09.2024
    DataChain AI-dataframe to enrich, transform and analyze data from cloud storages for ML training and LLM apps Daniel K
    • discord
    • docs
    • pypi
    • twitter
    • yt, yt
    Open In Colab 09.09.2024
    TFF for Federated Learning Research: Model and Update Compression We use the EMNIST dataset to demonstrate how to enable lossy compression algorithms to reduce communication cost in the Federated Averaging algorithm Weikang Song Open In Colab 05.09.2024
    LlamaIndex Data framework for your LLM application Jerry Liu Open In Colab 05.09.2024
    Deforum Stable Diffusion Open source project is designed to be free to use and easy to modify for custom needs and pipelines Open In Colab 30.08.2024
    Nerfstudio API that allows for a simplified end-to-end process of creating, training, and testing NeRFs Open In Colab 19.08.2024
    highway-env A collection of environments for autonomous driving and tactical decision-making tasks Edouard Leurent
    • arxiv, arxiv, arxiv
    • docs
    • git, git, git
    Open In Colab 09.08.2024
    GNN Production-tested library for building GNNs at large scale
    • arxiv
    • kaggle
    • medium
    • pypi
    • tf, tf
    • yt, yt, yt, yt, yt, yt
    Open In Colab 09.08.2024
    Image classification This tutorial shows how to classify images of flowers Billy Lamberta
    • pwc
    • tf, tf, tf
    Open In Colab 24.07.2024
    Kor Half-baked prototype that "helps" you extract structured data from text using LLMs Eugene Yurtsev
    • discord
    • docs
    Open In Colab 20.07.2024
    Mistral Inference Minimal code to run Mistral models mistral Open In Colab 16.07.2024
    XLand-MiniGrid Suite of tools and grid-world environments for meta-reinforcement learning research
    • arxiv
    • deepmind, deepmind
    • git, git, git
    • hf
    • neurips
    • pypi
    • twitter
    Open In Colab 12.07.2024
    PyTorch3D Library for deep learning with 3D data Open In Colab 11.07.2024
    Stable Diffusion Videos Create videos with Stable Diffusion by exploring the latent space and morphing between text prompts Nathan Raw
    • git, git
    Open In Colab 11.07.2024
    Presidio Context aware, pluggable and customizable PII de-identification service for text and images Omri Mendels
    • docker, docker, docker
    • docs
    • hf
    • medium, medium
    • pypi, pypi, pypi, pypi
    • yt, yt, yt, yt, yt
    Open In Colab 10.07.2024
    Transfer learning and fine-tuning You will learn how to classify images of cats and dogs by using transfer learning from a pre-trained network François Chollet
    • pwc
    • tf
    • wiki
    Open In Colab 26.06.2024
    MARS5 Speech model for insane prosody Matthew Baas Open In Colab 25.06.2024
    ToonCrafter Can interpolate two cartoon images by leveraging the pre-trained image-to-video diffusion priors Open In Colab 20.06.2024
    DiffSynth Restructured architectures including Text Encoder, UNet, VAE, among others, maintaining compatibility with models from the open-source community while enhancing computational performance Artiprocher
    • arxiv
    • hf, hf
    Open In Colab 06.06.2024
    Transformer This tutorial trains a Transformer model to translate Portuguese to English Open In Colab 31.05.2024
    NeMo A conversational AI toolkit built for researchers working on automatic speech recognition, natural language processing, and text-to-speech synthesis Open In Colab 25.05.2024
    SentencePiece An unsupervised text tokenizer and detokenizer mainly for Neural Network-based text generation systems where the vocabulary size is predetermined prior to the neural model training
    • arxiv, arxiv, arxiv, arxiv, arxiv
    • git, git, git, git
    • medium
    • yt
    Open In Colab 21.05.2024
    Llama3 from scratch Llama3 from scratch, one tensor and matrix multiplication at a time Nishant Aklecha
    • git
    • twitter, twitter
    • yt
    Open In Colab 19.05.2024
    IC-Light Manipulate the illumination of images
    • arxiv, arxiv
    • yt, yt, yt
    Open In Colab 09.05.2024
    Neural style transfer This tutorial uses deep learning to compose one image in the style of another image
    • arxiv
    Open In Colab 06.05.2024
    Autoencoders This tutorial introduces autoencoders with three examples: the basics, image denoising, and anomaly detection Billy Lamberta Open In Colab 15.04.2024
    MagicTime Metamorphic time-lapse video generation model, which learns real-world physics knowledge from time-lapse videos and implements metamorphic generation Open In Colab 14.04.2024
    SAGE Methodology for generative spelling correction, which was tested on English and Russian languages and potentially can be extended to any language with minor changes
    • arxiv
    • git
    • hf, hf, hf, hf, hf
    • wiki
    • yt
    Open In Colab 11.04.2024
    Open-Sora Plan Simple and efficient design along with remarkable performance in text-to-video generation YUAN Lab at PKU
    • arxiv
    • discord
    • git, git, git
    • hf, hf
    • yt, yt
    Open In Colab 07.04.2024
    AniPortrait Framework for generating high-quality animation driven by audio and a reference portrait image
    • arxiv
    • git, git, git, git, git
    • hf, hf, hf, hf, hf
    • reddit
    • yt, yt
    Open In Colab 27.03.2024
    OpenVINO Open-source toolkit for optimizing and deploying AI inference intel Open In Colab 25.03.2024
    Google Generative AI Documentation for Google's Gen AI site - including the Gemini API and Gemma Google Open In Colab 22.03.2024
    Gazelle Joint Speech Language Model Chris Hua Open In Colab 20.03.2024
    Intel® Extension for Transformers Transformer-based Toolkit to Accelerate GenAI/LLM Everywhere intel
    • arxiv, arxiv, arxiv, arxiv, arxiv
    • discord
    • docs
    • git, git, git, git, git, git
    • hf, hf, hf
    • medium, medium, medium, medium, medium
    • yt, yt, yt, yt, yt
    Open In Colab 19.03.2024
    Instructor Library that makes it a breeze to work with structured outputs from large language models Jason Liu
    • discord
    • docs
    • twitter
    • yt, yt, yt
    Open In Colab 13.03.2024
    MetaVoice 1.2B parameter base model trained on 100K hours of speech for TTS MetaVoice Open In Colab 26.02.2024
    OmegaConf Hierarchical configuration system, with support for merging configurations from multiple sources providing a consistent API regardless of how the configuration was created Omry Yadan Open In Colab 15.02.2024
    Optuna An automatic hyperparameter optimization software framework, particularly designed for machine learning Open In Colab 15.02.2024
    Data augmentation This tutorial demonstrates data augmentation: a technique to increase the diversity of your training set by applying random transformations such as image rotation Billy Lamberta
    • pwc
    • tf, tf
    • wiki
    Open In Colab 14.02.2024
    Stable Cascade Text to image model introduces an interesting three-stage approach, setting new benchmarks for quality, flexibility, fine-tuning, and efficiency with a focus on further eliminating hardware barriers Stability AI Open In Colab 14.02.2024
    CleanVision Automatically detects potential issues in image datasets like images that are: blurry, under/over-exposed, (near) duplicates, etc Sanjana Open In Colab 13.02.2024
    DynamiCrafter Animating Open-domain Images with Video Diffusion Priors Open In Colab 12.02.2024
    XLA Accelerated Linear Algebra is an open-source machine learning compiler for GPUs, CPUs, and ML accelerators George Karpenkov
    • medium, medium
    • pt
    • tf
    • wiki
    • yt, yt, yt, yt
    Open In Colab 02.02.2024
    Composer PyTorch library that enables you to train neural networks faster, at lower cost, and to higher accuracy The Mosaic ML Team Open In Colab 01.02.2024
    Transformer Engine Library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point precision on Hopper, Ada, and Blackwell GPUs, to provide better performance with lower memory utilization in both training and inference Open In Colab 19.01.2024
    Integrated gradients This tutorial demonstrates how to implement Integrated Gradients, an Explainable AI technique Open In Colab 17.01.2024
    MAGNeT Masked generative sequence modeling method that operates directly over several streams of audio tokens Open In Colab 16.01.2024
    AutoFaiss Automatically create Faiss knn indices with the most optimal similarity search parameters Romain Beaumont
    • docs
    • git
    • medium
    • pypi
    Open In Colab 12.01.2024
    Retrieval based Voice Conversion WebUI An easy-to-use Voice Conversion framework based on VITS 源文雨
    • discord
    • git, git, git, git, git, git
    • hf
    • medium
    • yt, yt, yt, yt, yt
    Open In Colab 11.01.2024
    Big Vision This codebase is designed for training large-scale vision models using Cloud TPU VMs or GPU machines
    • arxiv, arxiv, arxiv, arxiv, arxiv, arxiv, arxiv, arxiv, arxiv, arxiv, arxiv
    • tf, tf
    Open In Colab 03.01.2024
    Open Interpreter An open-source, locally running implementation of OpenAI's Code Interpreter Killian Lucas Open In Colab 03.01.2024
    Seamless Communication Family of AI models that enable more natural and authentic communication across languages Open In Colab 14.12.2023
    CleanRL Deep Reinforcement Learning library that provides high-quality single-file implementation with research-friendly features
    • arxiv, arxiv, arxiv, arxiv, arxiv, arxiv, arxiv
    • docs
    • git, git, git, git
    • hf
    • paper
    • yt, yt
    Open In Colab 28.11.2023
    Vocos Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis Hubert Siuzdak Open In Colab 21.11.2023
    X—LLM Easy LLM Finetuning using the most advanced methods Boris Zubarev
    • arxiv
    • discord
    • git, git, git
    • hf, hf
    • pypi
    Open In Colab 15.11.2023
    Distil-Whisper Maintains the robustness of the Whisper model to difficult acoustic conditions, while being less prone to hallucination errors on long-form audio
    • arxiv, arxiv
    • git, git
    • hf, hf, hf, hf, hf, hf, hf, hf
    • medium
    • reddit
    • yt, yt, yt
    Open In Colab 08.11.2023
    AnimateDiff Practical framework to animate most of the existing personalized text-to-image models once and for all, saving efforts in model-specific tuning Open In Colab 30.10.2023
    Intel® Neural Compressor Aims to provide popular model compression techniques such as quantization, pruning (sparsity), distillation, and neural architecture search on mainstream frameworks such as TensorFlow, PyTorch, ONNX Runtime, and MXNet, as well as Intel extensions such as Intel Extension for TensorFlow and Intel Extension for PyTorch intel
    • arxiv, arxiv, arxiv
    • discord
    • docs
    • git, git, git
    • medium, medium
    • neurips
    • pt
    • yt, yt, yt, yt, yt, yt
    Open In Colab 27.10.2023
    Bark Transformer-based text-to-audio model suno Open In Colab 25.10.2023
    Mistral Transformer The most powerful language model for its size to date Open In Colab 09.10.2023
    Fooocus Image generating software Lvmin Zhang
    • arxiv
    • yt, yt, yt, yt
    Open In Colab 03.10.2023
    Actor-Critic This tutorial demonstrates how to implement the Actor-Critic method using TensorFlow to train an agent on the Open AI Gym CartPole-V0 environment Open In Colab 28.09.2023
    LiteLLM Call all LLM APIs using the OpenAI format [Bedrock, Huggingface, VertexAI, TogetherAI, Azure, OpenAI, Groq etc.] Open In Colab 23.09.2023
    MMagic AIGC toolbox for professional AI researchers and machine learning engineers to explore image and video processing, editing and generation OpenMMLab
    • discord
    • docs
    • git, git, git, git
    • medium
    • twitter
    • yt
    Open In Colab 11.09.2023
    SeqIO Library for processing sequential data to be fed into downstream sequence models
    • arxiv, arxiv, arxiv, arxiv
    • docs
    • tf, tf, tf, tf, tf, tf
    Open In Colab 08.09.2023
    MMAction2 An open-source toolbox for video understanding based on PyTorch MMAction2 Contributors Open In Colab 06.09.2023
    Home Robot Low-level API for controlling various home robots Chris Paxton
    • git, git, git, git, git, git, git, git, git
    Open In Colab 30.08.2023
    Neural Tangents Library designed to enable research into infinite-width neural networks Open In Colab 29.08.2023
    Stable Diffusion 2 New stable diffusion model at 768x768 resolution. Same number of parameters in the U-Net as 1.5, but uses OpenCLIP-ViT/H as the text encoder and is trained from scratch
    • arxiv, arxiv, arxiv, arxiv, arxiv, arxiv
    • git, git, git, git, git
    • hf, hf, hf, hf
    • yt
    Open In Colab 26.08.2023
    DALL·E Mini Generate images from a text prompt Open In Colab 22.08.2023
    Kandinsky 2.1 As text and image encoder it uses CLIP model and diffusion image prior between latent spaces of CLIP modalities Open In Colab 07.08.2023
    Stable Diffusion web UI A web interface for Stable Diffusion, implemented using Gradio library AUTOMATIC1111
    • arxiv
    • git, git, git, git, git, git, git, git, git, git, git, git, git, git, git, git, git, git, git
    • hf
    • medium, medium, medium
    • reddit
    • yt, yt, yt
    Open In Colab 05.08.2023
    SoftVC VITS Singing Voice Conversion svc develop team
    • git, git, git, git, git, git
    • hf
    Open In Colab 31.07.2023
    threestudio Unified framework for 3D content creation from text prompts, single images, and few-shot images, by lifting 2D text-to-image generation models
    • arxiv, arxiv, arxiv
    • discord
    • git, git, git, git, git, git, git, git, git
    • hf, hf
    • reddit
    • yt
    Open In Colab 28.07.2023
    Image captioning Given an image our goal is to generate a caption Open In Colab 25.07.2023
    Word2Vec Word2Vec is not a singular algorithm, rather, it is a family of model architectures and optimizations that can be used to learn word embeddings from large datasets Open In Colab 25.07.2023
    Word embeddings This tutorial contains an introduction to word embeddings Billy Lamberta Open In Colab 25.07.2023
    Tortoise A multi-voice TTS system trained with an emphasis on quality James Betker Open In Colab 15.07.2023
    Petals Run 100B+ language models at home, BitTorrent-style BigScience Open In Colab 05.07.2023
    Epistemic Neural Networks A library for neural networks that know what they don't know
    • arxiv
    • medium
    • yt
    Open In Colab 27.06.2023
    DeepFloyd IF State-of-the-art open-source text-to-image model with a high degree of photorealism and language understanding Open In Colab 26.06.2023
    normflows PyTorch implementation of discrete normalizing flows
    • arxiv
    • docs
    • git, git
    • wiki
    Open In Colab 26.06.2023
    MMPose Toolbox for pose estimation based on PyTorch OpenMMLab
    • discord
    • docs
    • medium
    • pypi
    • twitter
    • yt, yt
    Open In Colab 19.06.2023
    MyoSuite A collection of musculoskeletal environments and tasks simulated with the MuJoCo physics engine and wrapped in the OpenAI gym API to enable the application of Machine Learning to bio-mechanic control problems
    • arxiv
    • docs
    Open In Colab 16.06.2023
    Audiocraft PyTorch library for deep learning research on audio generation Open In Colab 11.06.2023
    Detectron2 FAIR's next-generation platform for object detection and segmentation Yuxin Wu Open In Colab 26.05.2023
    Reverb Efficient and easy-to-use data storage and transport system designed for machine learning research
    • arxiv, arxiv, arxiv, arxiv, arxiv, arxiv, arxiv, arxiv
    • pypi
    • reddit
    Open In Colab 23.05.2023
    MMDetection Open source object detection toolbox based on PyTorch OpenMMLab
    • arxiv, arxiv, arxiv
    • discord
    • docs
    • git, git, git
    • medium
    • pwc, pwc, pwc
    • pypi
    • twitter
    • yt, yt, yt, yt, yt, yt
    Open In Colab 17.05.2023
    ChatRWKV Like ChatGPT but powered by RWKV (100% RNN) language model, which is the only RNN that can match transformers in quality and scaling, while being faster and saves VRAM Open In Colab 08.05.2023
    PyGlove General-purpose library for Python object manipulation
    • arxiv
    • docs
    • medium
    • neurips
    • pypi
    • reddit
    Open In Colab 06.05.2023
    Python Data Science Handbook Jupyter notebook version of the Python Data Science Handbook by Jake VanderPlas Jake Vanderplas Open In Colab 06.05.2023
    PGMax General factor graphs for discrete probabilistic graphical models, and hardware-accelerated differentiable loopy belief propagation in JAX
    • arxiv
    • wiki
    Open In Colab 05.05.2023
    StableLM Stability AI Language Models Stability AI Open In Colab 27.04.2023
    TTS A library for advanced Text-to-Speech generation, built on the latest research, was designed to achieve the best trade-off among ease-of-training, speed and quality Open In Colab 26.04.2023
    OpenCLIP An open source implementation of CLIP Open In Colab 16.04.2023
    Stable Baselines3 Set of reliable implementations of reinforcement learning algorithms in PyTorch Open In Colab 14.04.2023
    RL Baselines3 Zoo Training Framework for Stable Baselines3 Reinforcement Learning Agents Antonin Raffin
    • arxiv
    • docs
    • git, git, git
    • hf
    Open In Colab 14.04.2023
    Grounded-SAM Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect, Segment and Generate Anything IDEA-Research
    • arxiv, arxiv
    • git, git, git, git, git, git, git, git, git, git, git, git
    • yt, yt, yt, yt
    Open In Colab 12.04.2023
    TFDS Collection of ready-to-use datasets for use with TensorFlow, Jax, and other Machine Learning frameworks Ryan Sepassi
    • medium
    • pypi
    • tf
    • yt, yt, yt, yt
    Open In Colab 11.04.2023
    MMSegmentation Open source semantic segmentation toolbox based on PyTorch OpenMMLab
    • discord
    • docs
    • medium, medium
    • pypi
    • twitter
    • yt
    Open In Colab 31.03.2023
    LAVIS Python deep learning library for LAnguage-and-VISion intelligence research and applications Open In Colab 24.03.2023
    pymdp Package for simulating Active Inference agents in Markov Decision Process environments
    • arxiv
    • docs
    Open In Colab 19.03.2023
    MMSelfSup Open-source framework for visual pre-training
    • discord
    • docs
    • git
    • medium
    • pypi
    • twitter
    • yt
    Open In Colab 14.03.2023
    Tzer Coverage-Guided Tensor Compiler Fuzzing with Joint IR-Pass Mutation
    • arxiv
    • docker
    • docs
    • git
    Open In Colab 09.03.2023
    ArtLine A Deep Learning based project for creating line art portraits Vijish Madhavan Open In Colab 03.03.2023
    AmpliGraph A suite of neural machine learning models for relational Learning, a branch of machine learning that deals with supervised learning on knowledge graphs
    • arxiv, arxiv, arxiv, arxiv, arxiv, arxiv
    • docs
    • neurips, neurips
    • yt
    Open In Colab 23.02.2023
    NMT with attention This notebook trains a seq2seq model for Spanish to English translation Open In Colab 15.02.2023
    GLUE using BERT on TPU This tutorial contains complete end-to-end code to train models on a TPU Anirudh Dubey Open In Colab 15.02.2023
    TensorBoard Suite of web applications for inspecting and understanding your TensorFlow runs and graphs Yuan Tang Open In Colab 10.02.2023
    High-performance Simulation with Kubernetes This tutorial will describe how to set up high-performance simulation using a TFF runtime running on Kubernetes Jason Roselander Open In Colab 31.01.2023
    Compel Text prompt weighting and blending library for transformers-type text embedding systems Damian Stewart
    • git
    • hf
    Open In Colab 26.01.2023
    DALL·E Flow An interactive workflow for generating high-definition images from text prompt
    • git, git
    • hf
    • yt, yt
    Open In Colab 26.01.2023
    Diffusers Provides pretrained diffusion models across multiple modalities, such as vision and audio, and serves as a modular toolbox for inference and training of diffusion models Hugging Face
    • arxiv, arxiv, arxiv, arxiv, arxiv
    • git, git, git, git
    • hf, hf, hf, hf, hf
    • medium
    • yt
    Open In Colab 17.01.2023
    Sample Factory One of the fastest RL libraries focused on very efficient synchronous and asynchronous implementations of policy gradients Open In Colab 17.01.2023
    Open-Assistant Chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so Open In Colab 14.01.2023
    panda-gym Set of robotic environments based on PyBullet physics engine and gymnasium
    • [<