Skip to content

Popular repositories Loading

  1. Awesome-Video-Diffusion Awesome-Video-Diffusion Public

    A curated list of recent diffusion models for video generation, editing, and various other applications.

    4.8k 299

  2. Tune-A-Video Tune-A-Video Public

    [ICCV 2023] Tune-A-Video: One-Shot Tuning of Image Diffusion Models for Text-to-Video Generation

    Python 4.3k 393

  3. computer_use_ootb computer_use_ootb Public

    Out-of-the-box (OOTB) GUI Agent for Windows and macOS

    Python 1.7k 165

  4. Show-o Show-o Public

    [ICLR 2025] Repository for Show-o series, One Single Transformer to Unify Multimodal Understanding and Generation.

    Python 1.6k 72

  5. ShowUI ShowUI Public

    [CVPR 2025] Open-source, End-to-end, Vision-Language-Action model for GUI Agent & Computer Use.

    Python 1.4k 100

  6. Show-1 Show-1 Public

    [IJCV] Show-1: Marrying Pixel and Latent Diffusion Models for Text-to-Video Generation

    Python 1.1k 57

Repositories

Showing 10 of 99 repositories
  • Multi-human-Talking-Video-Dataset Public

    Official repository for Muti-human Interactive Talking Dataset

    showlab/Multi-human-Talking-Video-Dataset’s past year of commit activity
    Python 16 0 1 0 Updated Aug 6, 2025
  • Awesome-Video-Diffusion Public

    A curated list of recent diffusion models for video generation, editing, and various other applications.

    showlab/Awesome-Video-Diffusion’s past year of commit activity
    4,836 299 0 0 Updated Aug 4, 2025
  • TrustScorer Public

    ACM MM 2025 Can I Trust You? Advancing GUI Task Automation with Action Trust Score

    showlab/TrustScorer’s past year of commit activity
    2 MIT 0 0 0 Updated Aug 3, 2025
  • Awesome-MLLM-Hallucination Public

    đź“– A curated list of resources dedicated to hallucination of multimodal large language models (MLLM).

    showlab/Awesome-MLLM-Hallucination’s past year of commit activity
    787 33 1 0 Updated Aug 1, 2025
  • Awesome-Unified-Multimodal-Models Public

    đź“– This is a repository for organizing papers, codes and other resources related to unified multimodal models.

    showlab/Awesome-Unified-Multimodal-Models’s past year of commit activity
    653 35 2 0 Updated Aug 1, 2025
  • Show-o Public

    [ICLR 2025] Repository for Show-o series, One Single Transformer to Unify Multimodal Understanding and Generation.

    showlab/Show-o’s past year of commit activity
    Python 1,633 Apache-2.0 72 51 2 Updated Jul 31, 2025
  • WorldGUI Public

    Enable AI to control your PC. This repo includes the WorldGUI Benchmark and GUI-Thinker Agent Framework.

    showlab/WorldGUI’s past year of commit activity
    Python 90 7 1 0 Updated Jul 27, 2025
  • Impossible-Videos Public

    ICML 2025 - Impossible Videos

    showlab/Impossible-Videos’s past year of commit activity
    Python 72 6 1 0 Updated Jul 23, 2025
  • SMS Public

    [ICCV 2025] Balanced Image Stylization with Style Matching Score

    showlab/SMS’s past year of commit activity
    Python 58 MIT 2 0 0 Updated Jul 20, 2025
  • livecc Public

    LiveCC: Learning Video LLM with Streaming Speech Transcription at Scale (CVPR 2025)

    showlab/livecc’s past year of commit activity
    Python 253 36 4 1 Updated Jul 18, 2025

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Top languages

Loading…