
Madelon Hulsebos | GitTables: A Large-Scale Corpus of Relational Tables | #36
Disseminate: The Computer Science Research Podcast
Episode · 0 Play
Episode · 45:54 · Jul 17, 2023
About
Summary:The success of deep learning has sparked interest in improving relational table tasks, like data preparation and search, with table representation models trained on large table corpora. Existing table corpora primarily contain tables extracted from HTML pages, limiting the capability to represent offline database tables. To train and evaluate high-capacity models for applications beyond the Web, we need resources with tables that resemble relational database tables. In this episode, Madelon Hulsebos tells us all about such a resource! Tune in to learn more about GitTables!! Links: Madelon's websiteGitTables homepageSIGMOD'23 paperBuy Me A Coffee! Hosted on Acast. See acast.com/privacy for more information.
45m 54s · Jul 17, 2023
© 2023 Acast AB (OG)