glomelurus.com: A Geek's Story: Thesis

Thesis

This thesis program is the result of my work from Dec 2014 until around Mar 2015. It is a parallelisation of swarm - an existing single-linkage clustering algorithm for metagenomics data. This program arranges the complete hierarchical clustering workflow into of a set of finely-grained computations (referred as row calculations) that can be run in parallel. The concept of "subseed" was re-used from swarm to skip most of the redundant comparisons (referred as economic search).

The code for global pairwise alignment and k-mer filtering scheme were also re-used from Swarm since they were proven to be very efficient with the SIMD utilisation.

Based on the tests that were run on medium sequence length dataset (averagely 381 nucleotides), the speedup that can be achieved through the parallelisation is averagely 11 times when using 16 threads. This result indicates a successful parallelisation technic which according to Amdahl's law it means 97% of the code were able to be parallelised.

4 level economic search

sequence distance triangle inequality

And the full document:

5 comments: (+add yours?)

Unknown said...: September 11, 2015 at 7:03 PM; This comment has been removed by a blog administrator.
Wright Petter said...: April 13, 2018 at 12:46 PM; Hi, Great.. Tutorial is just awesome..It is really helpful for a newbie like me.. I am a regular follower of your blog. Really very informative post you shared here.
Kindly keep blogging. If anyone wants to become a Front end developer learn from Javascript Online Training from India . or learn thru JavaScript Online Training from India. Nowadays JavaScript has tons of job opportunities on various vertical industry.
ES6 Training in Chennai
Women Seeking Men Indio said...: February 4, 2025 at 4:32 PM; This is an impressive demonstration of how parallelism can significantly improve performance in metagenomics data analysis.
Tamizh said...: April 10, 2026 at 6:29 PM; I like how simple this content is.
It makes learning easier.
Check this Social Media Marketing Course.
Gokul said...: April 21, 2026 at 7:59 PM; Nice post! Consider UI/UX Courses in Chennai.

glomelurus.com: A Geek's Story

Pages

Thesis

5 comments: (+add yours?)

Popular Posts

Tags

Archives

Try Me!!
Enter some numbers here:

glomelurus.com: A Geek's Story

Pages

Thesis

5 comments: (+add yours?)

Popular Posts

Tags

Archives

Try Me!! Enter some numbers here:

Try Me!!
Enter some numbers here: