Workshop
Workshop on Weakly Supervised Learning
Benjamin Roth · Barbara Plank · Alex Ratner · Katharina Kann · Dietrich Klakow · Michael Hedderich
Fri 7 May, 7 a.m. PDT
Deep learning relies on massive training sets of labeled examples to learn from - often tens of thousands to millions to reach peak predictive performance. However, large amounts of training data are only available for very few standardized learning problems. Even small variations of the problem specification or changes in the data distribution would necessitate re-annotation of large amounts of data.
However, domain knowledge can often be expressed by sets of prototypical descriptions. These knowledge-based descriptions can be either used as rule-based predictors or as labeling functions for providing partial data annotations. The growing field of weak supervision provides methods for refining and generalizing such heuristic-based annotations in interaction with deep neural networks and large amounts of unannotated data.
In this workshop, we want to advance theory, methods and tools for allowing experts to express prior coded knowledge for automatic data annotations that can be used to train arbitrary deep neural networks for prediction. Learning with weak supervision is both studied from a theoretical perspective as well as applied to a variety of tasks from areas like natural language processing and computer vision. This workshop aims at bringing together researchers from this wide range of fields to facilitate discussions across research areas that share the common ground of using weak supervision. A target of this workshop is also to inspire applications of weak supervision to new scenarios and to enable researchers to work on tasks that so far have been considered too low-resource.
As weak supervision addresses one of the major issues of current machine learning techniques, the lack of labeled data, it has also started to obtain commercial interest. This workshop is an opportunity to bridge innovations from academia and the requirements of industry settings.
Schedule
Fri 7:00 a.m. - 7:10 a.m.
|
Introduction and Opening Remarks
(
Introduction
)
>
|
🔗 |
Fri 7:10 a.m. - 8:10 a.m.
|
Invited Speaker Dan Roth - Natural Language Understanding with Incidental Supervision
(
Keynote Talk
)
>
SlidesLive Video |
Dan Roth 🔗 |
Fri 8:10 a.m. - 8:25 a.m.
|
Invited Talk Dan Roth - Q&A
(
Q&A
)
>
|
🔗 |
Fri 8:25 a.m. - 9:10 a.m.
|
Invited Speaker Marine Carpuat - Weak Supervision for Cross-Lingual Semantic Analysis
(
Keynote Talk
)
>
SlidesLive Video |
Marine Carpuat 🔗 |
Fri 9:10 a.m. - 9:25 a.m.
|
Invited Speaker Marine Carpuat - Q&A
(
Q&A
)
>
|
🔗 |
Fri 9:25 a.m. - 9:40 a.m.
|
Dependency Structure Misspecification in Multi-Source Weak Supervision Models
(
Contributed Talk
)
>
SlidesLive Video |
Salva Rühling Cachay 🔗 |
Fri 9:40 a.m. - 9:50 a.m.
|
Dependency Structure Misspecification in Multi-Source Weak Supervision Models - Q&A
(
Q&A
)
>
|
🔗 |
Fri 9:50 a.m. - 9:53 a.m.
|
AutoTriggER: Named Entity Recognition with Auxiliary Trigger Extraction
(
Poster Spotlight
)
>
SlidesLive Video |
Dong-Ho Lee 🔗 |
Fri 9:53 a.m. - 9:56 a.m.
|
Handling Long-Tail Queries with Slice-Aware Conversational Systems
(
Poster Spotlight
)
>
SlidesLive Video |
Cheng Wang 🔗 |
Fri 9:56 a.m. - 9:59 a.m.
|
Tabular Data Modeling via Contextual Embeddings
(
Poster Spotlight
)
>
SlidesLive Video |
Xin Huang 🔗 |
Fri 9:59 a.m. - 10:02 a.m.
|
TADPOLE: Task ADapted Pre-training via anOmaLy dEtection
(
Poster Spotlight
)
>
SlidesLive Video |
Vivek Madan 🔗 |
Fri 10:02 a.m. - 10:05 a.m.
|
Active WeaSuL: Improving Weak Supervision with Active Learning
(
Poster Spotlight
)
>
SlidesLive Video |
Samantha Biegel 🔗 |
Fri 10:05 a.m. - 10:08 a.m.
|
Transformer Language Models as Universal Computation Engines
(
Poster Spotlight
)
>
SlidesLive Video |
Kevin Lu 🔗 |
Fri 10:15 a.m. - 11:15 a.m.
|
Poster Session 1 ( Poster Session ) > link | 🔗 |
Fri 11:15 a.m. - 11:20 a.m.
|
Welcome Back
(
Introduction
)
>
|
🔗 |
Fri 11:20 a.m. - 11:50 a.m.
|
Invited Speaker Heng Ji - InfoSurgeon: Cross-media Weak Supervision for Knowledge-Element Level Fake News Detection
(
Keynote Talk
)
>
SlidesLive Video |
Heng Ji 🔗 |
Fri 11:50 a.m. - 12:05 p.m.
|
Invited Speaker Heng Ji - Q&A
(
Q&A
)
>
|
🔗 |
Fri 12:05 p.m. - 12:20 p.m.
|
Weakly Supervised Multi-task Learning for Concept-based Explainability
(
Contributed Talk
)
>
SlidesLive Video |
Vladimir Balayan 🔗 |
Fri 12:20 p.m. - 12:30 p.m.
|
Weakly Supervised Multi-task Learning for Concept-based Explainability - Q&A
(
Q&A
)
>
|
🔗 |
Fri 12:30 p.m. - 12:45 p.m.
|
Better Adaptation to Distribution Shifts with Robust Pseudo-Labeling
(
Contributed Talk
)
>
SlidesLive Video |
Evgenia Rusak 🔗 |
Fri 12:45 p.m. - 12:55 p.m.
|
Better Adaptation to Distribution Shifts with Robust Pseudo-Labeling - Q&A
(
Q&A
)
>
|
🔗 |
Fri 12:55 p.m. - 12:58 p.m.
|
Using system context information to complement weakly labeled data
(
Poster Spotlight
)
>
SlidesLive Video |
Matthias Meyer 🔗 |
Fri 12:58 p.m. - 1:01 p.m.
|
CIGMO: Learning categorical invariant deep generative models from grouped data
(
Poster Spotlight
)
>
SlidesLive Video |
Haruo Hosoya 🔗 |
Fri 1:01 p.m. - 1:04 p.m.
|
Pre-Training by Completing Points Cloud
(
Poster Spotlight
)
>
SlidesLive Video |
Hanchen Wang 🔗 |
Fri 1:04 p.m. - 1:07 p.m.
|
Weakly-Supervised Group Disentanglement using Total Correlation
(
Poster Spotlight
)
>
SlidesLive Video |
Linh Tran 🔗 |
Fri 1:07 p.m. - 1:10 p.m.
|
Is Disentanglement all you need? Comparing Concept-based & Disentanglement Approaches
(
Poster Spotlight
)
>
SlidesLive Video |
Dmitry Kazhdan 🔗 |
Fri 1:07 p.m. - 1:10 p.m.
|
Pervasive Label Errors in Test Sets Destabilize Machine Learning Benchmarks
(
Poster Spotlight
)
>
SlidesLive Video |
Curtis G Northcutt 🔗 |
Fri 1:20 p.m. - 2:20 p.m.
|
Poster Session 2 ( Poster Session ) > link | 🔗 |
Fri 2:20 p.m. - 2:25 p.m.
|
Welcome Back
(
Introduction
)
>
|
🔗 |
Fri 2:25 p.m. - 3:10 p.m.
|
Invited Speaker Lu Jiang - Robust Deep Learning and Applications
(
Keynote Talk
)
>
SlidesLive Video |
Lu Jiang 🔗 |
Fri 3:10 p.m. - 3:25 p.m.
|
Invited Speaker Lu Jiang - Q&A
(
Q&A
)
>
|
🔗 |
Fri 3:25 p.m. - 3:55 p.m.
|
Invited Speaker Paroma Varma - Snorkel: Programmatically Labeling Training Data
(
Keynote Talk
)
>
SlidesLive Video |
Paroma Varma 🔗 |
Fri 3:55 p.m. - 4:10 p.m.
|
Invited Speaker Paroma Varma - Q&A
(
Q&A
)
>
|
🔗 |
Fri 4:10 p.m. - 5:10 p.m.
|
Panel Discussion
(
Panel
)
>
|
🔗 |
Fri 5:10 p.m. - 5:25 p.m.
|
Concluding Remarks
(
Conclusion
)
>
|
🔗 |
Fri 5:25 p.m. - 6:00 p.m.
|
Post Workshop Hangout ( Gather.Town ) > link | 🔗 |