Tweet

https://twitter.com/acm_ccs/status/1460955817516027905

https://twitter.com/djm_/status/1461086963616862212

This Thread may be Removed Anytime!

Twitter may remove this content at anytime! Save it as PDF for later use!

More from @simonw

Simon Willison

@simonw

18 Nov

https://twitter.com/elanazak/status/1460978501331337216

I've had a bit of a breakthrough with this over the past couple of years: maintaining detailed progress notes in a GitHub issues comment thread has dropped my "getting back on track" time down to a fraction of what it was

https://twitter.com/elanazak/status/1460978501331337216

The reason it takes 25 minutes to spin back up again is that you're holding a ton of stuff exclusively on your own memory - so write it down!

Something I've realized is that 90% of software engineering is research, not typing code - figuring out what the code needs to do, which APIs to use, how best to test it etc

So all of that research goes in issue comments. Here's my best recent example: github.com/simonw/s3-cred…

Read 5 tweets

Simon Willison

@simonw

17 Nov

One of the biggest productivity tricks I'm using in the Datasette ecosystem is continuous deployment of live demos - every time I push to Datasette (+ a few other repos) it deploys a demo of latest main - it's fantastic for both catching bugs and linking to from issue comments

I've been working on the datasette-graphql plugin today and the live demo at datasette-graphql-demo.datasette.io/graphql helped me catch a bug where JS files were loading in the wrong order, breaking things - a problem that didn't occur on my laptop

Here's the issue thread for that github.com/simonw/dataset…

Read 4 tweets

Simon Willison

@simonw

13 Nov

Datasette is four years old today! It's come a long way since that first release back in 2017 simonwillison.net/2017/Nov/13/da…

Progress on the project is pretty thoroughly documented by the datasette tag on my blog - 249 items and counting simonwillison.net/tags/datasette/

Something that really excites me about Datasette is that I can genuinely see myself still being thrilled to work on it ten years from now.

Projects like that are pretty rare - I feel very lucky to have found one that combines so many of my interests all in one package

Read 5 tweets

Simon Willison

@simonw

12 Nov

@datasetteproj

I presented a 90 minute workshop on Git scraping and how to explore the resulting data using @datasetteproj at #CodaBr21 (by @EscolaDeDados) today

Here are the exercises from the workshop, plus additional screenshots and notes that I added afterwards

docs.google.com/document/d/1TC…

@datasetteproj

@datasetteproj @EscolaDeDados To save attendees from having to get a working Python environment setup on their laptop, I instead encouraged them to use a free @gitpod account (gitpod.io) - I demonstrated each exercise in GitPod too

Cloud-based development environments are SO GOOD for tutorials

(I had planned to use GitHub Codespaces for this, but then realized that those are not freely available to non-paid users outside of the beta program yet)

Read 6 tweets

Simon Willison

@simonw

21 Aug

Here's a fun challenge: given an array of datetimes, what's the best way to plot those on a frequency graph over time?

They might all be on the same day, or they might be spread out over several years - so the challenge is automatically picking the most interesting bucket size

Looks like d3.bin().thresholds() is the answer I'm looking for observablehq.com/@d3/histogram

Yup, this works perfectly: observablehq.com/@simonw/my-twe…

Read 6 tweets

Simon Willison

@simonw

9 Aug

Is there a reliable way to tell search engine crawlers that a site hasn't been updated in X days so they don't need to re-crawl it?

Do they tend to believe the <lastmod> element in sitemap.xml ? And can I set that to apply to the whole site, not just an individual page?

https://twitter.com/simonw/status/1424801199753089035

Asking because tailing logs shows a vast amount of crawler traffic to Datasette instances that haven't seen any data changes in over a year - I may have to robots.txt block crawlers from them to save in costs, but I'd rather tell them "no point in crawling, nothing has changed"

https://twitter.com/simonw/status/1424801199753089035

Datasette currently has a plugin for configuring robots.txt, but I'm beginning to think it should be part of core and crawlers should be blocked by default - having people explicitly opt-in to having their sites crawled and indexed feels a lot safer datasette.io/plugins/datase…

Read 4 tweets

Share this page!

Simon Willison

Try unrolling a thread yourself!

More from @simonw

Simon Willison

Simon Willison

Simon Willison

Simon Willison

Simon Willison

Simon Willison

Did Thread Reader help you today?

Like this author's thread?