Named Entity Resolution with dslim/distilbert-NER
Related
More from Simon Willison TIL
I wanted to know how large the deepseek-ai/DeepSeek-V3-Base repo on Hugging Face was without actually downloading all of the files. With some help from Claude, here's the recipe that worked. First, clone the repo without having Git LFS download the files: GIT_LFS_SKIP_SMUDGE=1 git clone https://huggingface.co/deepseek-ai/DeepSeek-V3-Base cd...
I was getting the following warning for one of my Python test suites: DeprecationWarning: datetime.datetime.utcnow() is deprecated and scheduled for removal in a future version. Use timezone-aware objects to represent datetimes in UTC: datetime.datetime.now(datetime.UTC). I also saw a deprecation warning elsewhere for my usage of...
Here's what I learned about publishing a single file JavaScript package to NPM for my Prompts.js project. The code is in simonw/prompts-js on GitHub. The NPM package is prompts-js. A simple single file client-side package For this project, I wanted to create an old-fashioned JavaScript file that you could include in a web page using a tag. No...
My tools.simonwillison.net site is a growing collection of small HTML and JavaScript applications hosted as static files on GitHub Pages. Many of those tools take advantage of external APIs such as those provided by OpenAI and Anthropic and Google Gemini, thanks to the increasingly common access-control-allow-origin: * CORS header. I want to...
I really like Cog (previously) as a tool for automating aspects of my Python project documentation - things like the SQL schemas shown on the LLM logging page. When using cog in this way it's important to remember to run cog -r to update those generated files before pushing a commit. I've previously been enforcing this using GitHub Actions - as...