Congratulations to AUTOMATIC1111: A Stable Diffusion Web-UI built using @Gradio for 100k @github stars! 🎉🥳
🖼️Automatic1111 is the Ultimate Creative Playground!🚀
Let's explore in this thread what all is possible with this feature-packed Web UI 👇🧵
1️⃣ One-click install & run script for easy setup📥
2️⃣ Outpainting, inpainting, color sketching – unleash your artistic vision🎨
3️⃣ Control attention with ((image-element)) format for focus👁️
4️⃣ Loopback & X/Y/Z plots for dynamic image transformations📊
5️⃣ Generate variations🔄
🌟@Gradio's AUTO1111 is endlessly creative:
1️⃣Tweak defaults with text config & explore advanced sampling options🎛️
2️⃣Save styles (prompts) & apply them easily for consistent looks✏️
3️⃣Edit prompts on the fly for spontaneous transformations🔄
4️⃣Batch processing with img2img✨📊
🚀With AUTO1111's, the AI-powered features make your imagination run wild:
🎉 Revolutionize your creative process with AUTOMATIC1111 @Gradio web-ui:
1️⃣Dive into diverse image modes like outpainting & inpainting🎨🖌️
2️⃣Fine-tune attention with prompts & focus on specific details 👀🔍
3️⃣ Generate, tweak, and explore with intuitive UI elements 🔄🔆
🔥Create art that defies limits – @Gradio build AUTOMATIC1111 is your canvas! 🎨:
1️⃣Discover original txt2img & img2img modes for diverse outputs🖼️
2️⃣Generate, adjust, and experiment effortlessly with the UI
3️⃣Seamlessly merge checkpoints, access deepdanbooru integration & more!
There are many more features and applications of AUTOMATIC1111 @Gradio Web-UI, which are both fascinating and helpful to artists and hobbyists alike!!
🔥WILD! @AnthropicAI released Citations API for Claude last week -- a game-changing feature that will allow you to build trustworthy AI systems. Here's why this is HUGE... 🧵
1/ Build reliable RAG systems. AI should prove its work, not just give "trust me bro" responses. Anthropic's Citations API solves this elegantly: feed docs to Claude, enable citations, and get perfect source references! 🎯
@AnthropicAI 2/ What makes Citations special:
- Zero prompt engineering needed
- Automatic text chunking
- Works with PDFs & plain text
- Precise page/character citations
- Super efficient token usage
- Built for streaming
- No more complex citation systems! 🚀
@AnthropicAI 3/ The BEST part about Anthropic's Citations?
💁♀️ Citations are separate from response text, making verification dead simple.
🤑 And the cited text doesn't count towards your output tokens! Works with Claude 3.5 Sonnet & Haiku rn.
🆕 SwiftEdit: Lightning Fast Text-guided Image Editing via One-step Diffusion
Make your edit in just 0.23 seconds !!! 🤯
User-friendly, lightning-fast editing gradio-app that enables instant edits through simple prompts, delivering precise results.👇 Learn more about the project
SwiftEdit
> You can edit facial Identity and expressions via flexible text prompts
SwiftEdit for Instant text-guided image editing (in 0.23s)!
Novel contributions:
> One-step inversion framework that enables one-step image reconstruction
> Mask-guided editing technique to perform localized image editing.
We’ve been hard at work over the past few months, and we are excited to announce today the stable release of Gradio 5!
With more than 2 million users every month (and >470,000 apps on Hugging Face Spaces), Gradio has become the default way to build, share, and use machine learning applications.
Our goal with Gradio 5 was to address the most common pain points that we’ve heard from Gradio developers about taking these apps to production. Here are 5 new things in Gradio 5 (including a new way to build Gradio apps without writing any code!)
Does Gradio load too slowly for you?
Well, not anymore! Gradio 5 ships with major performance improvements, including the ability to serve apps via server-side rendering (SSR) which loads Gradio apps almost instantaneously in the browser. No more loading spinner! 🏎️💨
Have you ever felt that Gradio apps looks old-school?
We're refreshing many of the core Gradio components, including Buttons, Tabs, Sliders, as well as the high-level chatbot interface, with a modern design in Gradio 5. We’re also releasing a new set of built-in themes, like "citrus" and "ocean", to let you easily create fresh-looking Gradio apps 👨🎨
🔥 🔥 Generate 3D characters with CharacterGen (SIGGRAPH'24)
- Generated 3D characters have high-quality shapes and textures
- Useful for downstream applications such as animation and game dev
- Code, demo, and pretrained weights available now on 🤗
🚀 Generate 3D characters with CharacterGen (SIGGRAPH'24)
🚀Introducing LLaVA-NeXT Interleave: Now AI can understand and reason with multiple images at once
- This opens up multi-image scenarios like multi-frame videos, multi-view 3D, and multiple inter-leaved images.
- An all round LMM that can understand videos, images, and 3D
More⬇️
LLaVA-NeXT-Interleave🔥
- Interleave data format unifies different tasks.
- New datasets on 🤗Hub:
1️⃣M4-Instruct, high-quality dataset, 1.1M samples from domains: multi-image, video, 3D & single-image
2️⃣LLaVA-Interleave Bench - Set of tasks to evaluate multi-image capabilities
LLaVA-NeXT-Interleave💪
- Attached videos show how it can explain jokes and understand content spread in multiple images and videos 🤯
- SoTA Performance, both, in multi and single images
- Matches in perf with LLaVA-NeXT
- Improved performance in video tasks