NTK reparametrization

from blog Every Man a Debtor, 21 Jul 2023 | ↗ original

I’ve been learning about the neural tangent kernel and some things confused me. In the NTK paper, the network layers have the form \(\frac{1}{\sqrt{n_A}}A\) where \(n_A\) is the number of neurons in \(A\). Why the square root? So I worked it out. Setup Input: \(x: \mathbb{^*R_{\lim}}\), where \(\mathbb{^*R_{\lim}}\) means a limited (hyper)real...

This is a short summary. ↗ Open original to view full content

Trying Kolmogorov-Arnold Networks in Practice

cprimozic.net Blog | original ↗

Exploring Neural Networks Visually in the Browser

cprimozic.net Blog | original ↗

Fun with neural networks in Go

Cybernetist | original ↗

Logic Through the Lens of Neural Networks

cprimozic.net Blog | original ↗

On AlphaTensor’s new matrix multiplication algorithms

The ryg blog | original ↗

Reverse Engineering a Neural Network's Clever Solution to Binary Addition

cprimozic.net Blog | original ↗

Computing Integer Roots

Fred Akalin | original ↗

Hopfield networks in Go

Cybernetist | original ↗

Simple C++ Neural Network Library

The Angry Dev | original ↗

What is SwiGLU?

J. Carlos Roldán | original ↗

More from Every Man a Debtor

Compactness of the Classical Groups

2 Dec 2024 | original ↗

MOM Alok’s thinking about conic sections again. The Classical Groups The classical groups (that I was thinking about) are: Real Orthogonal Group: \(O(n, \mathbb{R})\). Real-valued matrices \(A\) such that \(A^T A = I\) where \(A^T\) is the transpose of \(A\), and \(I\) is the identity matrix. Complex Orthogonal Group: \(O(n, \mathbb{C})\)....

Derivative AT a Discontinuity

28 Sept 2024 | original ↗

But this is impossible by definition The title may seem like a contradiction. How can you differentiate something that’s not even continuous? The usual definition of the derivative of a function \(f\) at a point \(a\) is given by the limit: \[f'(a) := \lim_{h \to 0} \frac{f(a + h) - f(a)}{h}\] If \(f\) is differentiable at \(a\), then it is...

Continuous vs Bounded

29 Mar 2024 | original ↗

Nonstandard analysis gives a nice link between continuous and bounded functions, which are small and large-scale notions.1 Let \(X\) and \(Y\) be topological spaces. Identify them with their nonstandard extensions. A function \(f: X \rightarrow Y\) is continuous if \(x \approx x' \implies f(x) \approx f(x')\). Intuitively, infinitely close points...

discontinuous linear functions

29 Mar 2024 | original ↗

Let \(H > \mathbb{Nat}\) be unlimited. Then the linear map \(T(x: \mathbb{R}^*): \mathbb{R}^* := Hx\) is discontinuous. Why, its discontinuity is equivalent to it being unbounded. This holds in general, but this example is the germ of generality. See my previous post for definitions of bounded and continuous. The map \(T\) is unbounded since it...

Boolean Algebra, Arithmetic POV

29 Mar 2024 | original ↗

(written long time ago, publish or languish) These are some notes I made for Davide Radaelli for the first section of Schuller’s lectures on physics. Let’s turn Boolean algebra into something we know better: arithmetic. First we’ll set False to 0 and True to 1. To handle overflow, any arithmetic is mod 2. So even numbers are \(0\) and odd numbers...

NTK reparametrization

Related

More from Every Man a Debtor