Home on Erik Bernhardsson

It's hard to write code for computers, but it's even harder to write code for humans

27 Sept 2024 | original ↗

Writing code for a computer is hard enough. You take something big and fuzzy, some large vague business outcome you want to achive. Then you break it down recursively and think about all the cases until you have clear logical statements a computer can follow.

Predicting solar eclipses with Python

7 Apr 2024 | original ↗

As I am en route to see my first total solar eclipse, I was curious how hard it would be to compute eclipses in Python. It turns out, ignoring some minor coordinate system head-banging, I was able to get something half-decent working in a couple of hours.

Simple sabotage for software

13 Dec 2023 | original ↗

CIA produced a fantastic book during the peak of World War 2 called Simple Sabotage. It laid out various ways for infiltrators to ruin productivity of a company. Some of the advice is timeless, for instance the section about “General interference with Organizations and Production”:

What I have been working on: Modal

7 Dec 2022 | original ↗

Long story short: I'm working on a super cool tool called Modal. Please check it out — it lets you run things in the cloud without having to think about infrastructure. Scaling out, scheduling, containerization, using GPUs, setting up webhooks, and all kinds of other stuff.

We are still early with the cloud: why software development is overdue for a change

19 Oct 2022 | original ↗

This is is in many respects a successor to a blog post I wrote last year about what I want from software infrastructure, but the ideas morphed in my head into something sort of wider.

σ-driven project management: when is the optimal time to give up?

5 Apr 2022 | original ↗

Hi! It's your friendly project management theorician. You might remember me from blog posts such as Why software projects take longer than you think, which is a blog post I wrote a long time ago positing that software projects completion time follow a log-normal distribution.

Storm in the stratosphere: how the cloud will be reshuffled

30 Nov 2021 | original ↗

Here's a theory I have about cloud vendors (AWS, Azure, GCP): Cloud vendors1 will increasingly focus on the lowest layers in the stack: basically leasing capacity in their data centers through an API. Other pure-software providers will build all the stuff on top of it.

What is the right level of specialization? For data teams and anyone else.

23 Jul 2021 | original ↗

This isn't as much of a blog post as an elaboration of a tweet I posted the other day: I think this specialization of data teams into 99 different roles (data scientist, data engineer, analytics engineer, ML engineer etc) is generally a bad thing driven by the fact that tools are bad and too hard to use

Building a data team at a mid-stage startup: a short story

7 Jul 2021 | original ↗

I guess I should really call this a parable. The backdrop is: you have been brought in to grow a tiny data team (~4 people) at a mid-stage startup (~$10M annual revenue), although this story could take place at many different types of companies.

Software infrastructure 2.0: a wishlist

19 Apr 2021 | original ↗

Software infrastructure (by which I include everything ending with *aaS, or anything remotely similar to it) is an exciting field, in particular because (despite what the neo-luddites may say) it keeps getting better every year! I love working with something that moves so quickly.

What's Erik up to?

1 Apr 2021 | original ↗

I joined Better in early 2015 because I thought the team was crazy enough to actually change one of the largest industries in the US. For six years, I ran the tech team, hiring 300+ people, probably doing 2,000+ interviews, and according to GitHub I added 646,941 lines of code and removed 339,164.

Giving more tools to software engineers: the reorganization of the factory

16 Dec 2020 | original ↗

It's a popular attitude among developers to rant about our tools and how broken things are. Maybe I'm an optimistic person, because my viewpoint is the complete opposite! I had my first job as a software engineer in 1999, and in the last two decades I've seen software engineering changing in ways that have made us orders of magnitude more...

Developer experience as a competitive advantage

6 Oct 2020 | original ↗

I spent a ton of time looking at different software providers, both as a CTO, and as a nerd “advanced” consumer who builds stuff in my spare time. In the last 10 years, there has been an order of magnitude more products that cater directly to developers, through APIs, SDKs, and tooling.

Mortality statistics and Sweden's "dry tinder" effect

23 Sept 2020 | original ↗

We live in a year of about 350,000 amateur epidemiologists and I have no desire to join that “club”. But I read something about COVID-19 deaths that I thought was interesting and wanted to see if I could replicated it through data.

How to set compensation using commonsense principles

8 Jun 2020 | original ↗

Compensation has always been one of the most confusing parts of management to me. Getting it right is obviously extremely important. Compensation is what drives our entire economy, and you could look at the market for labor as one gigantic resource-allocating machine in the same way as people look at the stock market as a gigantic...

Never attribute to stupidity that which is adequately explained by opportunity cost

10 Mar 2020 | original ↗

Hanlon's razor is a classic aphorism I'm sure you have heard before: Never attribute to malice that which can be adequately explained by stupidity. I've found that neither malice nor stupidity is the most common reason when you don't understand why something is in a certain way.

How to hire smarter than the market: a toy model

13 Jan 2020 | original ↗

Let's consider a toy model where you're hiring for two things and that those are equally valuable. It's not very important what those are, so let's just call them “thing A” and “thing B” for now.

What can startups learn from Koch Industries?

19 Dec 2019 | original ↗

I recently finished the excellent book Kochland. This isn't my first interest in Koch—I read The Science of Success by Charles Koch himself a couple of years ago. Charles Koch inherited a tiny company in 1967 and turned it into one of the world's largest ones.

We're hiring at Better

9 Dec 2019 | original ↗

Just a quick note that my team is always hiring at Better. A lot of new people have been joining the team here in NYC lately—the tech team has actually grown from 35 to 60 in just ~3 months.

Buffet lines are terrible, but let's try to improve them using computer simulations

16 Oct 2019 | original ↗

My company has a buffet every Friday, and the lines grow to epic proportions when the food arrives. I've suspected for years that the “classic” buffet line system is a deeply flawed and inefficient method, and every time I'm stuck in the line has made me more convinced.

Miscellaneous unsolicited (and possibly biased) career advice

26 Sept 2019 | original ↗

No one asked for this, but I'm something like ~12 years into my career and have had my fair share of mistakes and luck so I thought I'd share some. Honestly, I feel like I've mostly benefitted from luck.

Modeling conversion rates using Weibull and gamma distributions

5 Aug 2019 | original ↗

This is a blog post originally featured on the Better engineering blog. If you want to link to this article or share it, please go to the original post URL! Separately, I'm sorry it's been so long with no posts on this blog.

Why software projects take longer than you think: a statistical model

15 Apr 2019 | original ↗

Anyone who built software for a while knows that estimating how long something is going to take is hard. It's hard to come up with an unbiased estimate of how long something will take, when fundamentally the work in itself is about solving something.

Headcount goals, feature factories, and when to hire those mythical 10x people

21 Feb 2019 | original ↗

When I started building up a tech team for Better, I made a very conscious decision to pay at the high end to get people. I thought this made more sense: they cost a bit more money to hire, but output usually more than compensates for it.

Data architecture vs backend architecture

10 Jan 2019 | original ↗

A modern tech stack typically involves at least a frontend and backend but relatively quickly also grows to include a data platform. This typically grows out of the need for ad-hoc analysis and reporting but possibly evolves into a whole oil refinery of cronjobs, dashboards, bulk data copying, and much more.

The hacker's guide to uncertainty estimates

8 Oct 2018 | original ↗

It started with a tweet: New years resolution: every plot I make during 2018 will contain uncertainty estimates — Erik Bernhardsson (@bernhardsson) January 7, 2018 Why? Because I've been sitting in 100,000,000 meetings where people endlessly debate whether the monthly number of widgets is going up or down, or whether widget method X is more...

I don't want to learn your garbage query language

30 Aug 2018 | original ↗

This is a bit of a rant but I really don't like software that invents its own query language. There's a trillion different ORMs out there. Another trillion databases with their own query language. Another trillion SaaS products where the only way to query is to learn some random query DSL they made up.

Business secrets from terrible people

16 Aug 2018 | original ↗

I get bored reading management books very easily and lately I've been reading about a wide range of almost arbitrary topics. One of the lenses I tend to read through is to see different management styles in different environments.

New approximate nearest neighbor benchmarks

17 Jun 2018 | original ↗

As some of you may know, one of my side interests is approximate nearest neighbor algorithms. I'm the author of Annoy, a library with 3,500+ stars on Github as of today. It offers fast approximate search for nearest neighbors with the additional benefit that you can load data super fast from disk using mmap.

Missing the point about microservices: it's about testing and deploying independently

4 Jun 2018 | original ↗

Ok, so I have to first preface this whole blog post by a few things: I really struggle with the term microservices. I can't put my finger on exactly why. Maybe because the term is hopelessly ill-defined, maybe because it's gotten picked up by the hype train.

Interviewing is a noisy prediction problem

2 May 2018 | original ↗

I have done roughly 2,000 interviews in my life. When I started recruiting, I had so much confidence in my ability to assess people. Let me just throw a couple of algorithm questions at a candidate and then I'll tell you if they are good or not!

Waiting time, load factor, and queueing theory: why you need to cut your systems a bit of slack

27 Mar 2018 | original ↗

I've been reading up on operations research lately, including queueing theory. It started out as a way to understand the very complex mortgage process (I work at a mortgage startup) but it's turned into my little hammer and now I see nails everywhere.

Lessons from content marketing myself (aka blogging) for five years

7 Mar 2018 | original ↗

I started writing this blog in late 2012, partly because I felt like it would help me improve my English and my writing skills, partly because I kept having a lot of random ideas in my head and I wanted to write them down somewhere.

New benchmarks for approximate nearest neighbors

15 Feb 2018 | original ↗

UPDATE(2018-06-17): There are is a later blog post with newer benchmarks! One of my super nerdy interests include approximate algorithms for nearest neighbors in high-dimensional spaces. The problem is simple. You have say 1M points in some high-dimensional space.

I'm looking for data engineers

28 Jan 2018 | original ↗

I'm interrupting the regular programming for a quick announcement: we're looking for data engineers at Better. You would be the first one to join and would work a lot directly with me. Some fun things you could work on (these are all projects I'm working on right now):

Books I consumed in 2017

17 Jan 2018 | original ↗

Turns out having a toddler isn't super compatible with reading. I used to read ~100 books/year as a teenager, but it has slowly deteriorated to maybe 20-30 books, at most. And I don't even finish all of them because life is too short!

Plotting author statistics for Git repos using Git of Theseus

3 Jan 2018 | original ↗

I spent a few days during the holidays fixing up a bunch of semi-dormant open source projects and I have a couple of blog posts in the pipeline about various updates. First up, I made a number of fixes to Git of Theseus which is a tool (written in Python) that generates statistics about Git repositories.

Toxic meeting culture

29 Dec 2017 | original ↗

I spent six years at a company that went from 50 people to 1500 and one contributing factor leading to my departure was that I went from a “maker” to a person stuck in meetings every day.

Learning from users faster using machine learning

12 Dec 2017 | original ↗

I had an interesting idea a few weeks ago, best explained through an example. Let's say you're running an e-commerce site (I kind of do) and you want to optimize the number of purchases. Let's also say we try to learn as much as we can from users, both using A/B tests but also using just basic slicing and dicing of the data.

Annoy 1.10 released, with Hamming distance and Windows support

26 Nov 2017 | original ↗

I've been a bit bad at posting things with a regular cadence lately, partly because I'm trying to adjust to having a toddler, partly because the hunt for clicks has caused such a high bar for me that I feel like I have to post something Pulitzer-worthy.

Why conversion matters: a toy model

30 Oct 2017 | original ↗

There are often close relationships between top level business metrics. For instance, it's well known that retention has a super strong impact on the valuation of a subscription business. Or that the % of occupied seats is super important for an airline.

On the Equifax breach and how to really prevent identity theft

26 Sept 2017 | original ↗

A funny thing about being a foreigner is how you realize people take broken things for granted. I'm going to go out on a limb here claiming that the US has a pretty dumb banking system.

The number of letters in the word for each number

6 Sept 2017 | original ↗

Just for fun, I generated these graphs of the number of letters in the word for each number. I really spent about 10 minutes on this (ok…possibly also another 40 minutes tweaking the plots): More languages!

The software engineering rule of 3

29 Aug 2017 | original ↗

Here's a dumb extremely accurate rule I'm postulating* for software engineering projects: *you need at least 3 examples before you solve the right problem*. This is what I've noticed: Don't factor out shared code between two classes.

Machine, Platform, Crowd

19 Aug 2017 | original ↗

I just bought Machine, Platform, Crowd: Harnessing Our Digital Future and discovered that it mentions my blog – in particular the post When machine learning matters. Ok, I lied a little bit. I didn't discover it serendipitously.

Google diversity memo, global warming, Pascal's wager, and other stuff

14 Aug 2017 | original ↗

There's about 765 million blog posts about the diversity “memo” that leaked out of Google a couple of weeks ago. I think the case for any biological difference is pretty weak, and it bothers me when people refer to an “interest gap” as anything else than caused by the environment.

Fun with trigonometry: the world's most twisted coastline

12 Jul 2017 | original ↗

I just spent a few days in Italy, on the Ligurian coast. Even though we were on the west side of Italy, the Mediterranean sea was to the east, because the house was situated on a long bay.

Optimizing for iteration speed

6 Jul 2017 | original ↗

I've written before about the importance of iterating quickly but I didn't necessarily talk about some concrete things you can do. When I've built up the tech team at Better, I've intentionally optimized for fast iteration speed above almost everything else.

Home on Erik Bernhardsson

Related blogs