OpenVision (ICCV 2025), OpenVision 2 (CVPR 2026), and OpenVision 3
-
Updated
Feb 21, 2026 - Python
OpenVision (ICCV 2025), OpenVision 2 (CVPR 2026), and OpenVision 3
A curated collection of resources focused on the Mechanistic Interpretability (MI) of Large Multimodal Models (LMMs). This repository aggregates surveys, blog posts, and research papers that explore how LMMs represent, transform, and align multimodal information internally.
Recognize Any Regions
[--branch main] FSFM: A Generalizable Face Security Foundation Model via Self-Supervised Facial Representation Learning (CVPR25) [--branch FSVFM-extension] Scalable Face Security Vision Foundation Model for Deepfake, Diffusion, and Spoofing Detection (Extented Version)
[ICLR 2026] FSOD-VFM: Few-Shot Object Detection with Vision Foundation Models and Graph Diffusion
A Vision Foundation Model for Cine Cardiac Magnetic Resonance Imaging
One-Shot Open Affordance Learning with Foundation Models (CVPR 2024)
Official repository of "CoMP: Continual Multimodal Pre-training for Vision Foundation Models"
MonoDINO-DETR: Depth-Enhanced Monocular 3D Object Detection Using a Vision Foundation Model
This repo collects some latest research work of Generative AI. It provides simple implementations to understand the ideas and some follow-up discussions to inspire future work.
Implementation of CAST: Contrastive Adaptation and Distillation for Semi-Supervised Instance Segmentation.
"Boosting Gaze Object Prediction via Pixel-level Supervision from Vision Foundation Model"
Codebase for probing VFMs and Feature Upsamplers using Intractive Segmentation.
A Synthetic Benchmark for Evaluating Spatial Intelligence in Visual Foundation Models
Simple Gradio application integrated with Hugging Face Multimodals to support visual question answering chatbot and more features
A Training Objective for Interpretable Monosemantic Representations
Florence-2 quick test
ProRobo3D Benchmark to be release...
Add a description, image, and links to the vision-foundation-model topic page so that developers can more easily learn about it.
To associate your repository with the vision-foundation-model topic, visit your repo's landing page and select "manage topics."