Tweet

This Thread may be Removed Anytime!

Twitter may remove this content at anytime! Save it as PDF for later use!

More from @mdavidallen

𝔻𝕒𝕧𝕚𝕕 𝔸𝕝𝕝𝕖𝕟

@mdavidallen

20 Oct

The cat, as a metaphor for software engineering

Sub-systems -- and "System of Systems"

Abstraction

Read 7 tweets

𝔻𝕒𝕧𝕚𝕕 𝔸𝕝𝕝𝕖𝕟

@mdavidallen

18 Oct

For some reason, some mutuals got me thinking about smoking

Memory: working as a teen, I knew a guy who was 100% sure that it was the glue in cigarette papers that caused cancer, not the tobacco. He smoked natural tobacco leaf-wrapped cigars, thought he was all good

at the shop where I worked, there was a 90-yr old guy who walked with a cane. Was the spitting image of William S. Burroughs. Every day, he came in and got his usual, 2 packs of Lucky Strikes unfiltered

when you're a teenage smoker yourself, a 90-yr old with a 2-pack a day Lucky Strikes habit is like a hero, but even then I knew he was more lucky than "right"

Read 8 tweets

𝔻𝕒𝕧𝕚𝕕 𝔸𝕝𝕝𝕖𝕟

@mdavidallen

14 Sep

@TheHackersNews

Example thread on how to load #JSON into #Neo4j Aura -- working up from simple to more complex. Let's use the .@TheHackersNews public API to load a mini-feed of stories.

First: head endpoint with best stories, and simplest JSON load:

the apoc.load.json call always returns "value" with whatever came back. HackerNews is sending results, an array of post IDs.

We can extract out just the post IDs with a bit of extra cypher like this. Nice clean array of long values.

One step further; now we will UNWIND the array, turning the nested array into each individual item, and then build the URL we'll ask of HackerNews to get the detail of each story. This is how we build URLs one by one; we just take the story ID and concat it into a string URL

Read 9 tweets

𝔻𝕒𝕧𝕚𝕕 𝔸𝕝𝕝𝕖𝕟

@mdavidallen

26 Jul

Real developer problem that gets solved by #graph. Today's thread: JSON document nesting

🧵

so it's common to have data elements like a "food" with some related information (here, it's "nutrition information"). When using a document database you basically have two choices:

- Store the two documents separately and lookup the other when needed
- Store them together

Storing them together is great for efficient read operations. But it's bad for updates. If many foods share the same nutrition facts, and you need to update the facts, you have many updates to do to change one simple thing

because you duplicated data in sub-documents

Read 9 tweets

𝔻𝕒𝕧𝕚𝕕 𝔸𝕝𝕝𝕖𝕟

@mdavidallen

19 May

A function is a DB that maps a key/input set to a value/result that's why they memoize so well

A DB is an impure function that returns a value given a particular input/query

GitHub is a database of programs

And data.gov is a program that returns DBs

Streams and tables are kinda the same thing looked at through different lenses

🤯

docs.confluent.io/platform/curre…

Tables and graphs are kinda the same thing looked at through different lenses

🤯

Read 5 tweets

𝔻𝕒𝕧𝕚𝕕 𝔸𝕝𝕝𝕖𝕟

@mdavidallen

19 May

@neo4j

Batch vs. streaming data ingest into #graph and .@neo4j

(mini thread)

So the main typical tradeoff is latency. Batch when you need fresh data in larger volumes, say once per hour/day/week/month

Stream when time value of data is high/immediate and you can't afford to be more than minutes behind

The overall event queue (so to speak) that's being ingested has a total velocity. Let's say it's

- 1M events/day
- ~42k events/hour
- ~694 events/min
- ~69 events/sec

Let's say 2kb per event, or roughly 2gb/day, 138kb/sec.

Read 20 tweets

Share this page!

𝔻𝕒𝕧𝕚𝕕 𝔸𝕝𝕝𝕖𝕟

Try unrolling a thread yourself!

More from @mdavidallen

𝔻𝕒𝕧𝕚𝕕 𝔸𝕝𝕝𝕖𝕟

𝔻𝕒𝕧𝕚𝕕 𝔸𝕝𝕝𝕖𝕟

𝔻𝕒𝕧𝕚𝕕 𝔸𝕝𝕝𝕖𝕟

𝔻𝕒𝕧𝕚𝕕 𝔸𝕝𝕝𝕖𝕟

𝔻𝕒𝕧𝕚𝕕 𝔸𝕝𝕝𝕖𝕟

𝔻𝕒𝕧𝕚𝕕 𝔸𝕝𝕝𝕖𝕟

Did Thread Reader help you today?

Like this author's thread?