Skip to content
View aditya10's full-sized avatar

Highlights

  • Pro

Organizations

@CPSC-436I-Project @TIBET-AI

Block or report aditya10

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Refine high-quality datasets and visual AI models

Python 9,119 590 Updated Jan 30, 2025

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 21,236 2,339 Updated Aug 12, 2024

Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)

Python 25,537 2,926 Updated Sep 2, 2024

Official implementation for "Break-A-Scene: Extracting Multiple Concepts from a Single Image" [SIGGRAPH Asia 2023]

Python 510 24 Updated Jan 14, 2024

LAVIS - A One-stop Library for Language-Vision Intelligence

Jupyter Notebook 10,195 991 Updated Nov 18, 2024

Torch implementation of Soft-DTW, supports CUDA.

Python 36 2 Updated Feb 24, 2023

[ECCV 2022] Tensorial Radiance Fields, a novel approach to model and reconstruct radiance fields

Python 1,195 153 Updated Sep 27, 2023

Stable Diffusion with Core ML on Apple Silicon

Python 17,104 964 Updated Jan 23, 2025

FastAPI framework, high performance, easy to learn, fast to code, ready for production

Python 80,353 6,873 Updated Jan 30, 2025

Official repository for the A-OKVQA dataset

Python 71 8 Updated May 8, 2024

Recent Transformer-based CV and related works.

1,328 143 Updated Aug 22, 2023

Official Implementation for paper "Referring Transformer: A One-step Approach to Multi-task Visual Grounding" Neurips 2021

Python 66 1 Updated May 26, 2022

Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.

Python 31,103 7,569 Updated Jan 14, 2025

X-modaler is a versatile and high-performance codebase for cross-modal analytics(e.g., image captioning, video captioning, vision-language pre-training, visual question answering, visual commonsens…

Python 967 105 Updated Feb 27, 2023

🗑️ Cleanup script for macOS (DEPRECATED)

Shell 2,656 249 Updated May 21, 2023

Oscar and VinVL

Python 1,040 251 Updated Aug 28, 2023

Mac Media Keys for the Masses

Objective-C 2,836 277 Updated May 12, 2021

Dataset API for "PhraseCut: Language-based Image Segmentation in the Wild"

Jupyter Notebook 109 12 Updated May 13, 2020

Code for Referring Image Segmentation via Cross-Modal Progressive Comprehension, CVPR2020.

Python 60 16 Updated Feb 2, 2021

Multi Task Vision and Language

Jupyter Notebook 803 179 Updated Feb 16, 2022

[CVPR2020] Multi-task Collaborative Network for Joint Referring Expression Comprehension and Segmentation, CVPR2020 (oral)

Python 137 25 Updated Aug 4, 2022

Referring Expression Object Segmentation with Caption-Aware Consistency, BMVC 2019

Python 30 8 Updated Apr 21, 2021

MAttNet: Modular Attention Network for Referring Expression Comprehension

Jupyter Notebook 293 75 Updated Nov 29, 2022

📚 A collection of papers about Referring Image Segmentation.

663 58 Updated Nov 11, 2024

Machine Learning algorithm implementations from scratch.

Python 1,376 543 Updated Feb 1, 2024

Animation engine for explanatory math videos

Python 74,552 6,495 Updated Jan 8, 2025

The example project of inferencing Semantic Segementation using Core ML

Swift 328 32 Updated Mar 27, 2021

A lightweight, loosely coupled, Model-View-Controller framework to aid in REDCap plugin development.

PHP 2 1 Updated Sep 13, 2017
Next