FELIX, a fast and flexible text-editing system that models large structural changes and achieves a 90x speed-up compared to seq2seq approaches whilst achieving impressive results on four monolingual generation tasks.
Compared to traditional seq2seq methods, FELIX has the following three key advantages:
Sample efficiency: Training a high precision text generation model typically requires large amounts of high-quality supervised data.
FELIX uses three techniques to minimize the amount of required data:
(1) fine-tuning pre-trained checkpoints, (2) a tagging model that learns a small number of edit operations, and (3) a text insertion task that is very similar to the pre-training task.
Fast inference time: FELIX is fully non-autoregressive, avoiding slow inference times caused by an autoregressive decoder.
Flexible text editing: FELIX strikes a balance between the complexity of learned edit operations and flexibility in the transformations it models
• • •
Missing some Tweet in this thread? You can try to
force a refresh
How do you create a beautiful interface for your machine learning or data science project?
Handmade from scratch?
Any good tools?
Sure there are incredible tools:
Beautiful ML & DS interfaces
Gradio
Quickly create customizable UI components around your ML models. By dragging-and-dropping in your own images, pasting your own text, recording your own voice & seeing what the model outputs.
Dash apps bring Python analytics to everyone with a point-&-click interface to models written in Python, R & Julia - vastly expanding the notion of what's possible in a traditional dashboard.
Learn more about spaCy v3.0 and its new features like: transformer-based pipelines, the new training config and workflow system to help you take projects from prototype to production.
STEP BY STEP
↓ 2/4
01:54 – State-of-the-art transformer-based pipelines
05:03 – Declarative configuration system
11:06 – Workflows for end-to-end projects
17:03 – Trainable and rule-based components
21:43 – Custom models in any framework
26:20 – Features and summary
They first introduce an optimization scheme that utilizes a CLIP-based loss to modify an input latent vector in response to a user-provided text prompt.
Next, they describe a latent mapper that infers a text-guided latent manipulation step for a given input image, allowing faster and more stable textbased manipulation.
The past decade has witnessed a groundbreaking rise of machine learning for human language analysis, with current methods capable of automatically accurately recovering various aspects of syntax and semantics - including sentence
structure and grounded word meaning - from large data collections.
Recent research showed the promise of such tools for analyzing acoustic communication in nonhuman species.
They posit that machine learning will be the cornerstone of future collection, processing, and analysis of multimodal streams of data in animal communication studies, including bioacoustic, behavioral, biological, and environmental data.