Alex Amadori - FloydHub Blog

Sign in Subscribe

Alex Amadori

Alex is a software engineering student at 42, Paris. He’s interested in exploring Deep Learning and striving to gain a better understanding of the field in order to contribute to the research.

Distilling knowledge from Neural Networks to build smaller and faster models

Deep Learning Featured

Distilling knowledge from Neural Networks to build smaller and faster models

This article discusses GPT-2 and BERT models, as well using knowledge distillation to create highly accurate models with fewer parameters than their teachers