Hello Deep Learning: Dropout, data augmentation, weight decay and quantisation

from blog Bert Hubert's writings, 30 Mar 2023 | ↗ original

This page is part of the Hello Deep Learning series of blog posts. You are very welcome to improve this page via GitHub! In the previous chapter we found ways to speed up our character recognition learning by a factor of 20 by using a better optimizer, and a further factor of four by cleverly using threads using a ‘shared nothing architecture’....

This is a short summary. ↗ Open original to view full content

Hello Deep Learning

Robert Heaton | Blog | original ↗

Easy Speedup Wins With Numba

Sebastian Witowski | original ↗

Trying Kolmogorov-Arnold Networks in Practice

cprimozic.net Blog | original ↗

Analyzing 50k fonts using deep neural networks

Home on Erik Bernhardsson | original ↗

Designing a SIMD Algorithm from Scratch

mcyoung | original ↗

Llama 3.2: New Edge AI and Vision Models

Tao of Mac | original ↗

Speed matters

Scattered Thoughts | original ↗

Make Your Program Slower With Threads

Marc Brooker's Blog | original ↗

A Simple Tip to Improve Rust Program Speed

Reasonable Performance | original ↗

TIL: constant folding in python.

Alex Molas Blog | original ↗

More from Bert Hubert's writings

Shifting Cyber Norms: Microsoft security POST-ing to you

23 Jan 2025 | original ↗

tl;dr: Microsoft and other email security scanners will visit the links in email you transmit, and run the JavaScript in those links, including calls that lead to POSTs going out. This used to be unacceptable, since POSTs have side effects. Yet here we are. This breaks even somewhat sophisticated single-use sign-on / email confirmation messages....

The surprising struggle to get a UNIX Epoch time from a UTC string in C or C++

19 Jan 2025 | original ↗

So how hard could it be. As input we have something like Fri, 17 Jan 2025 06:07:07 in UTC, and we’d like to turn this into 1737094027, the notional (but not actual) number of seconds that have passed since 1970-01-01 00:00:00 UTC. Trying to figure this out led me to discover many ‘surprise features’ and otherwise unexpected behaviour of POSIX...

On Long Term Software Development

22 Dec 2024 | original ↗

Recently the Dutch Electoral Board (where I am also a very part time advisor) invited me to do a talk reflecting on their open source Abacus vote tabulation software. Much software is now provided as a service, and is typically deployed continuously (CD, continuous deployment), surrounded by enough automated testing (CI, continuous integration)...

Welkom bij OpenTK (deel 2): de monitors

13 Dec 2024 | original ↗

Welkom! Goed inzicht in ons parlement is belangrijk, soms omdat er dingen in het nieuws zijn. En soms juist omdat dingen (nog) niet in het nieuws zijn, maar er binnenkort wel besluiten over genomen gaan worden. De Tweede Kamer publiceert alles wat ze doen via een technische API, en dat is echt geweldig. Hierdoor kunnen we op Internet eigen...

Servers in de EU, eigen (dubbele) sleutels: helpt het?

11 Dec 2024 | original ↗

Naar aanleiding van veel misverstanden hier een kort stukje over: Helpt het als Microsoft/Google/Amazon beloven dat mijn gegevens op servers in de EU worden opgeslagen? Kan ik met “eigen sleutels” / bring your own key / double key encryption / aparte opslag, mijn data in praktische zin beschermen tegen Amerikaanse spionage? Kan ik me zo...

Hello Deep Learning: Dropout, data augmentation, weight decay and quantisation

Related

More from Bert Hubert's writings