cumulo-autumn / StreamDiffusion
StreamDiffusion: A Pipeline-Level Solution for Real-Time Interactive Generation
See what the GitHub community is most excited about today.
StreamDiffusion: A Pipeline-Level Solution for Real-Time Interactive Generation
Dev tool that writes scalable apps from scratch while the developer oversees the implementation
完全免费开源,基于 Requests 模块实现:TikTok 主页/视频/图集/原声;抖音主页/视频/图集/收藏/直播/原声/合集/评论/账号/搜索/热榜数据采集工具
Official implementation of `Splatter Image: Ultra-Fast Single-View 3D Reconstruction'
免费;轻量;开源,基于 AIOHTTP 模块实现的小红书图文 / 视频作品采集工具
A collective list of free APIs
Official implementations for paper: Anydoor: zero-shot object-level image customization
Revolutionizing Database Interactions with Private LLM Technology
An open-source impl. of Large Reconstruction Models
Get a ChatGPT plugin up and running in under 5 minutes!
Instant voice cloning by MyShell
A sound cloning tool with a web interface, using your voice or any sound to record audio / 一个带web界面的声音克隆工具,使用你的音色或任意声音来录制音频
⚡ Building applications with LLMs through composability ⚡
A PyTorch-based Speech Toolkit
Azur Lane bot (CN/EN/JP/TW) 碧蓝航线脚本 | 无缝委托科研,全自动大世界
The code for "Osprey: Pixel Understanding with Visual Instruction Tuning"
🌟 The Multi-Agent Framework: Given one line Requirement, return PRD, Design, Tasks, Repo
A nearly-live implementation of OpenAI's Whisper.
Fast and memory-efficient exact attention
Specify what you want it to build, the AI asks for clarification, and then builds it.
The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.