Chris Seymour at $ONT.L Oxford @nanopore tweets about Apple Silicon's AMX instruction set. A mysterious set of instructions now beginning to be mapped out by a group of volunteer developers. So what does Apple Silicon mean for #Bioinformatics?
As many will know, $AAPL Apple came up with their own silicon a few years ago, a branch off the @ARM 64-bit designs, of which they've been diverging ever since. The Apple M1 and M2 cores have been thoroughly reviewed in social media, but less from the optics of #Bioinformatics.
First of all, what reason did @Apple have to design their own silicon? Could they not just have adopted an @ARM design and stick to it? Well, the history of the company is full of examples of Apple wanting to do "their own thing" rather than integrating someone else's silicon...
... in their own products. But this new endeavour of the Apple Silicon line of products is the boldest mode for Apple as a company: have their own silicon designs, book large chunks of TSMCs fab capacity to produce them, integrate the silicon tightly with their own software ...
... stack to fully optimize hardware and software together. And this software stack is what developers will use to compile their software against, including #Bioinformatics software developers.
The basic components of an Apple Silicon M1/M2 chip are (1) the standard ARMv8 capabilities, including the #SIMD#NEON vector instructions, plus (2) Apple's undocumented AMX instructions, then (3) The Neural Engine called ANE or NPU, and (4) The GPU (Metal Compute Shaders).
Because Apple cares about their own software stack, which Apple employees develop themselves, most of the blueprints on how to access the silicon are not well documented, or not at all in the public domain. That's the case for Apple's AMX1/AMX2 instructions.
So developers out there, including #Bioinformatics developers can decide to do this heavy lifting themselves either by (a) adopting the libraries that Apple themselves supports, or (b) use something like Rosetta2 or (c) wait for the OSS community to reverse engineer ...
... the instructions sets, implement them over the OSS software stack (Linux, LLVM, etc.) and bring the available OSS software stack to parity for Apple Silicon compared to where it currently is. Each approach has its pros and cons.
The Rosetta2 approach is the easiest, but also the least performant. Whereas relying on Apple's software stack can get tricky if there are unsupported layers, it can bring a performance boost compared to the Rosetta2 approach, e.g. in #Bioinformatics software.
• • •
Missing some Tweet in this thread? You can try to
force a refresh
On library prep at #Nanoporeconf, a description for PCR-free methods showing the difference between ligation (max output) and rapid mode (10minutes, minimal lab equipment needed). Ultralong reads (ULR) also enabled, all Kit14.
Rapid ULR. Current record is about 4 megabases.
PCR expansion kits enable the use of samples with low input amount.
I did a deep dive on the different workflow management (WFM) tools for #Bioinformatics Data Analysis a few years ago, and since then there have been a few extra entrants in this segment, still mostly concentrated in serving the Next-Generation Sequencing field.
A few years ago, there were two communities dominating the open-source WFM ecosystem in NextFlow and SnakeMake, and two platforms dominating the the commercial offerings in DNAnexus and Illumina BaseSpace.
Since then, a company out of the founders of Nextflow has started offering enterprise support for Nextflow workflows in the cloud: Seqera Labs. They offer the extra level of support that some organizations require to run Nextflow on their Data Analysis setups.
More interesting Next-Generation Sequencing knowledge in the ASeq Discord channel (by @new299). Illumina patterned flowcells and the etching process to "print" the wells into the flowcell. Could be down to 350nm diameter for some flowcell configurations now.
If I remember correctly, Illumina started with a 600nm diameter for the patterned flowcell, in the HiSeq X and then later on in the evolution of the platform that used these patterned flowcells.
They then said to have gone down to 500nm, and what you are showing seems to indicate that it's at 350nm now, at least for the NextSeq 2000? I am not sure if they claimed that for NovaSeq X?
There have been some acquisitions in #CancerDiagnostics and #CancerScreening recently, some of which signify a trend towards consolidation that is worth describing:
$A Agilent is moving towards some more vertical integration in Cancer Dx and Cancer screening
by recently acquiring both announcing a partnership with Akoya Bio and announcing the acquisition of Avida Biomed.
Some may ask: isn’t $A Agilent too small to go into this field? Would they be able to compete against $ILMN Illumina/GrailBio or $GH Guardant Health or $EXAS Exact Sciences?
It is likely that as Spatial Biology tools become more robust and user-friendly, they will become increasingly popular and widely adopted in the scientific community.
This may lead to a shift in the balance between single-cell and Spatial Biology approaches, with the latter eventually becoming more prevalent.
Additionally, as more and more datasets are generated using Spatial Biology techniques, the field of Machine Learning and Artificial Intelligence will likely play an increasingly important role in analysing and interpreting this data.