Cosmos-Predict2.5, the latest version of the Cosmos World Foundation Models (WFMs) family, specialized for simulating and predicting the future state of the world in the form of video.
-
Updated
May 4, 2026 - Python
Cosmos-Predict2.5, the latest version of the Cosmos World Foundation Models (WFMs) family, specialized for simulating and predicting the future state of the world in the form of video.
MOMENT: A Family of Open Time-series Foundation Models, ICML'24
Collection of resources on the applications of Large Language Models (LLMs) in Audio AI.
👁️ + 💬 + 🎧 = 🤖 Curated list of top foundation and multimodal models! [Paper + Code + Examples + Tutorials]
A curated list of academic papers and resources on Physical AI — focusing on Vision-Language-Action (VLA) models, world models, embodied ai, and robotic foundation models.
Official repository for "CLIP model is an Efficient Continual Learner".
Google Cloud Medical Imaging ML Development Accelerators
ACM Multimedia 2023 (Oral) - RTQ: Rethinking Video-language Understanding Based on Image-text Model
In this course navigates through the LLMOps pipeline, enabling you to preprocess training data for supervised fine-tuning and deploy custom Large Language Models (LLMs).
This repository is for profiling, extracting, visualizing and reusing generative AI weights to hopefully build more accurate AI models and audit/scan weights at rest to identify knowledge domains for risk(s).
🐍📦 High-performance cosine similarity ranking for Retrieval-Augmented Generation (RAG) pipelines.
PyTorch-native framework for fault-tolerant distributed training of foundation models on AI clusters
Glycan Informed Foundational Framework for Learning Abstract Representations, based on Combinatorial Complexes and Heterogeneous GNNs
Go library for unified access to foundational AI models
High-performance LLM inference engine written in C++ & CUDA from scratch. No PyTorch. Custom memory paging, fused kernels, and INT4 quantization.
Course assignments of COL828:- Advanced Computer Vision course at IIT Delhi under Professor Chetan Arora
An improved temporal data pipeline with foundational model for battery State of Health (SOH) prediction (R²->0.99) using advanced time series decomposition (D3R, CEEMDAN) and transformer-based methods. Utilized 100-150 features (ARIMA-based, Rolling statistics, Degradation indicators)
On-device App Store Optimization copilot for indie macOS developers. Swift, SwiftUI, Foundation Models
🔋 Predict battery State of Health (SOH) with a cutting-edge model using time series decomposition and transformer methods for accurate results.
Computationally efficient multi-scale foundation model for computational pathology. 71M parameters, runs end-to-end on a single consumer GPU.
Add a description, image, and links to the foundational-models topic page so that developers can more easily learn about it.
To associate your repository with the foundational-models topic, visit your repo's landing page and select "manage topics."