Skip to main navigation Skip to search Skip to main content

Teddy: Efficient Large-Scale Dataset Distillation via Taylor-Approximated Matching

Research output: Chapter in Book/Report/Conference proceedingConference PaperResearchpeer-review

Abstract

Dataset distillation or condensation refers to compressing a large-scale dataset into a much smaller one, enabling models trained on this synthetic dataset to generalize effectively on real data. Tackling this challenge, as defined, relies on a bi-level optimization algorithm: a novel model is trained in each iteration within a nested loop, with gradients propagated through an unrolled computation graph. However, this approach incurs high memory and time complexity, posing difficulties in scaling up to large datasets such as ImageNet. Addressing these concerns, this paper introduces Teddy, a Taylor-approximated dataset distillation framework designed to handle large-scale dataset and enhance efficiency. On the one hand, backed up by theoretical analysis, we propose a memory-efficient approximation derived from Taylor expansion, which transforms the original form dependent on multi-step gradients to a first-order one. On the other hand, rather than repeatedly training a novel model in each iteration, we unveil that employing a pre-cached pool of weak models, which can be generated from a single base model, enhances both time efficiency and performance concurrently, particularly when dealing with large-scale datasets. Extensive experiments demonstrate that the proposed Teddy attains state-of-the-art efficiency and performance on the Tiny-ImageNet and original-sized ImageNet-1K dataset, notably surpassing prior methods by up to 12.8%, while reducing 46.6% runtime. Our code will be available at https://github.com/Lexie-YU/Teddy.

Original languageEnglish
Title of host publicationComputer Vision – ECCV 2024 - 18th European Conference, Milan, Italy, September 29–October 4, 2024, Proceedings, Part XLVI
EditorsAleš Leonardis, Elisa Ricci, Stefan Roth, Olga Russakovsky, Torsten Sattler, Gül Varol
Place of PublicationCham Switzerland
PublisherSpringer
Pages1-17
Number of pages17
ISBN (Electronic)9783031729522
ISBN (Print)9783031729515
DOIs
Publication statusPublished - 2025
Externally publishedYes
EventEuropean Conference on Computer Vision 2024 - Milan, Italy
Duration: 29 Sept 20244 Oct 2024
Conference number: 18th
https://eccv2024.ecva.net/Conferences/2024/Dates
http://chrome-extension://efaidnbmnnnibpcajpcglclefindmkaj/https://media.eventhosts.cc/Conferences/ECCV2024/ConferenceProgram.pdf (Proceedings)

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
PublisherSpringer
Volume15104 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

ConferenceEuropean Conference on Computer Vision 2024
Abbreviated titleECCV 2024
Country/TerritoryItaly
CityMilan
Period29/09/244/10/24
Internet address

Keywords

  • Dataset distillation
  • Efficient training
  • Taylor approximation

Cite this