N. Ahmed | TU Delft Repository

Comparison of cloud-to-cloud distance calculation methods for change detection in spatio-temporal point clouds

The advantages of using point clouds for change detection analysis include comprehensive spatial and temporal representation, as well as high precision and accuracy in the calculations. These benefits make point clouds a powerful data type for spatio-temporal analysis. Neverthele ...

Comparison of Cloud-to-Cloud Distance Calculation Methods - Is the Most Complex Always the Most Suitable?

Comparison of Cloud-to-Cloud Distance Calculation Methods

Is the Most Complex Always the Most Suitable?

Cloud-to-cloud (C2C) distance calculations are frequently performed as an initial stage in change detection and spatiotemporal analysis with point clouds. There are various methods for calculating C2C distance, also called inter-point distance, which refers to the distance betwee ...

SALoBa

Maximizing Data Locality and Workload Balance for Fast Sequence Alignment on GPUs

Sequence alignment forms an important backbone in many sequencing applications. A commonly used strategy for sequence alignment is an approximate string matching with a two-dimensional dynamic programming approach. Although some prior work has been conducted on GPU acceleration o ...

GPU acceleration of Darwin read overlapper for de novo assembly of long DNA reads

Journal article (2020) - Nauman Ahmed (author), Nauman Ahmed (author), Nauman Ahmed (author), Nauman Ahmed (author), Nauman Ahmed (author), Nauman Ahmed (author), N. Ahmed (author), N. Ahmed (author), N. Ahmed (author), Tong Dong Qiu (author), T.D. Qiu (author), K Bertels (author), K Bertels (author), KLM Bertels (author), KLM Bertels (author), K.L.M. Bertels (author), K.L.M. Bertels (author), K. Bertels (author), K. Bertels (author), Koen Bertels (author), Koen Bertels (author), Zaid Al-Ars (author), Zaid Al-Ars (author), Z. Al-Ars (author), Z Al-Ars (author)

BACKGROUND: In Overlap-Layout-Consensus (OLC) based de novo assembly, all reads must be compared with every other read to find overlaps. This makes the process rather slow and limits the practicality of using de novo assembly methods at a large scale in the field. Darwin is a fas ...

High Performance Seed-and-Extend Algorithms for Genomics

Doctoral thesis (2020) - N. Ahmed (author), Nauman Ahmed (author), Nauman Ahmed (author)

Recent advances in DNA sequencing technology have opened new doors for scientists to use genomic data analysis in a variety of applications that directly affect human lives. However, the analysis of unprecedented volumes of sequencing data being produced represents a formidable c ...

Optimizing performance of GATK workflows using Apache Arrow In-Memory data framework

Background: Immense improvements in sequencing technologies enable producing large amounts of high throughput and cost effective next-generation sequencing (NGS) data. This data needs to be processed efficiently for further downstream analyses. Computing systems need this large a ...

Background: Immense improvements in sequencing technologies enable producing large amounts of high throughput and cost effective next-generation sequencing (NGS) data. This data needs to be processed efficiently for further downstream analyses. Computing systems need this large amounts of data closer to the processor (with low latency) for fast and efficient processing. However, existing workflows depend heavily on disk storage and access, to process this data incurs huge disk I/O overheads. Previously, due to the cost, volatility and other physical constraints of DRAM memory, it was not feasible to place large amounts of working data sets in memory. However, recent developments in storage-class memory and non-volatile memory technologies have enabled computing systems to place huge data in memory to process it directly from memory to avoid disk I/O bottlenecks. To exploit the benefits of such memory systems efficiently, proper formatted data placement in memory and its high throughput access is necessary by avoiding (de)-serialization and copy overheads in between processes. For this purpose, we use the newly developed Apache Arrow, a cross-language development framework that provides language-independent columnar in-memory data format for efficient in-memory big data analytics. This allows genomics applications developed in different programming languages to communicate in-memory without having to access disk storage and avoiding (de)-serialization and copy overheads. Implementation: We integrate Apache Arrow in-memory based Sequence Alignment/Map (SAM) format and its shared memory objects store library in widely used genomics high throughput data processing applications like BWA-MEM, Picard and GATK to allow in-memory communication between these applications. In addition, this also allows us to exploit the cache locality of tabular data and parallel processing capabilities through shared memory objects. Results: Our implementation shows that adopting in-memory SAM representation in genomics high throughput data processing applications results in better system resource utilization, low number of memory accesses due to high cache locality exploitation and parallel scalability due to shared memory objects. Our implementation focuses on the GATK best practices recommended workflows for germline analysis on whole genome sequencing (WGS) and whole exome sequencing (WES) data sets. We compare a number of existing in-memory data placing and sharing techniques like ramDisk and Unix pipes to show how columnar in-memory data representation outperforms both. We achieve a speedup of 4.85x and 4.76x for WGS and WES data, respectively, in overall execution time of variant calling workflows. Similarly, a speedup of 1.45x and 1.27x for these data sets, respectively, is achieved, as compared to the second fastest workflow. In some individual tools, particularly in sorting, duplicates removal and base quality score recalibration the speedup is even more promising. Availability: The code and scripts used in our experiments are available in both container and repository form at: https://github.com/abs-tudelft/ArrowSAM.

@en

ArrowSAM

In-Memory Genomics Data Processing Using Apache Arrow

Conference paper (2020) - T. Ahmad (author), Tanveer Ahmad (author), Nauman Ahmed (author), Nauman Ahmed (author), N. Ahmed (author), N. Ahmed (author), Nauman Ahmed (author), Nauman Ahmed (author), J.W. Peltenburg (author), Johan Peltenburg (author), Zaid Al-Ars (author), Zaid Al-Ars (author), Z Al-Ars (author), Z. Al-Ars (author)

The rapidly growing size of genomics data bases, driven by advances in sequencing technologies, demands fast and cost-effective processing. However, processing this data creates many challenges, particularly in selecting appropriate algorithms and computing platforms. Computing s ...

Efficient GPU Acceleration for Computing Maximal Exact Matches in Long DNA Reads

Conference paper (2020) - N. Ahmed (author), Nauman Ahmed (author), Nauman Ahmed (author), KLM Bertels (author), KLM Bertels (author), K.L.M. Bertels (author), K.L.M. Bertels (author), K. Bertels (author), K. Bertels (author), Koen Bertels (author), Koen Bertels (author), K Bertels (author), K Bertels (author), Zaid Al-Ars (author), Z Al-Ars (author), Zaid Al-Ars (author), Z. Al-Ars (author)

The seeding heuristic is widely used in many DNA analysis applications to speed up the analysis time. In many applications, seeding takes a substantial amount of the total execution time. In this paper, we present an efficient GPU implementation for computing maximal exact matchi ...

Correction to

GASAL2: A GPU accelerated sequence alignment library for high-Throughput NGS data (BMC Bioinformatics (2019) 20 (520) DOI: 10.1186/s12859-019-3086-9)

Journal article (2019) - N. Ahmed (author), Nauman Ahmed (author), Nauman Ahmed (author), Jonathan Lévy (author), S. Ren (author), Shanshan Ren (author), Hamid Mushtaq (author), KLM Bertels (author), KLM Bertels (author), K Bertels (author), K Bertels (author), K.L.M. Bertels (author), K.L.M. Bertels (author), K. Bertels (author), K. Bertels (author), Koen Bertels (author), Koen Bertels (author), Zaid Al-Ars (author), Z Al-Ars (author), Zaid Al-Ars (author), Z. Al-Ars (author)

Following publication of the original article [1], the author requested changes to the figures 4, 7, 8, 9, 12 and 14 to align these with the text. The corrected figures are supplied below. The original article [1] has been corrected. [Typesetter, please insert new supplied figure ...

GASAL2

A GPU accelerated sequence alignment library for high-throughput NGS data

Journal article (2019) - N. Ahmed (author), N. Ahmed (author), Nauman Ahmed (author), Nauman Ahmed (author), Nauman Ahmed (author), Nauman Ahmed (author), Jonathan Lévy (author), S. Ren (author), Shanshan Ren (author), Hamid Mushtaq (author), KLM Bertels (author), KLM Bertels (author), K Bertels (author), K Bertels (author), K.L.M. Bertels (author), K.L.M. Bertels (author), K. Bertels (author), K. Bertels (author), Koen Bertels (author), Koen Bertels (author), Zaid Al-Ars (author), Z Al-Ars (author), Zaid Al-Ars (author), Z. Al-Ars (author)

BACKGROUND: Due the computational complexity of sequence alignment algorithms, various accelerated solutions have been proposed to speedup this analysis. NVBIO is the only available GPU library that accelerates sequence alignment of high-throughput NGS data, but has limited perfo ...

SparkGA2

Production-quality memory-efficient Apache Spark based genome analysis framework

Journal article (2019) - Hamid Mushtaq (author), H. Mushtaq (author), Nauman Ahmed (author), N. Ahmed (author), Nauman Ahmed (author), Zaid Al-Ars (author), Zaid Al-Ars (author), Z Al-Ars (author), Z. Al-Ars (author)

Due to the rapid decrease in the cost of NGS (Next Generation Sequencing), interest has increased in using data generated from NGS to diagnose genetic diseases. However, the data generated by NGS technology is usually in the order of hundreds of gigabytes per experiment, thus req ...

GPU Accelerated Sequence Alignment with Trace-back for GATK HaplotypeCaller

Journal article (2019) - S. Ren (author), Shanshan Ren (author), Nauman Ahmed (author), Nauman Ahmed (author), N. Ahmed (author), K Bertels (author), K Bertels (author), KLM Bertels (author), KLM Bertels (author), K.L.M. Bertels (author), K.L.M. Bertels (author), K. Bertels (author), K. Bertels (author), Koen Bertels (author), Koen Bertels (author), Zaid Al-Ars (author), Z Al-Ars (author), Z. Al-Ars (author), Zaid Al-Ars (author)

Background: Pairwise sequence alignment is widely used in many biological tools and applications. Existing GPU accelerated implementations mainly focus on calculating optimal alignment score and omit identifying the optimal alignment itself. In GATK HaplotypeCaller (HC), the semi ...

Diminished-1 Fermat Number Transform for Integer Convolutional Neural Networks

Conference paper (2019) - B. Baozhou (author), B. Zhu (author), Zhu Baozhou (author), Baozhou Zhu (author), Baozhou Baozhou (author), Zhu Zhu (author), N. Ahmed (author), Nauman Ahmed (author), Nauman Ahmed (author), Johan Peltenburg (author), J.W. Peltenburg (author), K.L.M. Bertels (author), K.L.M. Bertels (author), K Bertels (author), K Bertels (author), Koen Bertels (author), Koen Bertels (author), KLM Bertels (author), KLM Bertels (author), K. Bertels (author), K. Bertels (author), Z. Al-Ars (author), Z Al-Ars (author), Zaid Al-Ars (author), Zaid Al-Ars (author)

Convolutional Neural Networks (CNNs) are a class of widely used deep artificial neural networks. However, training large CNNs to produce state-of-the-art results can take a long time. In addition, we need to reduce compute time of the inference stage for trained networks to make ...

An Efficient GPU-based de Bruijn Graph Construction Algorithm for Micro-Assembly

Conference paper (2018) - S. Ren (author), Shanshan Ren (author), Nauman Ahmed (author), Nauman Ahmed (author), N. Ahmed (author), K Bertels (author), K Bertels (author), KLM Bertels (author), KLM Bertels (author), K.L.M. Bertels (author), K.L.M. Bertels (author), K. Bertels (author), K. Bertels (author), Koen Bertels (author), Koen Bertels (author), Zaid Al-Ars (author), Z Al-Ars (author), Z. Al-Ars (author), Zaid Al-Ars (author)

In order to improve the accuracy of indel detection, micro-assembly is used in multiple variant callers, such as the GATK HaplotypeCaller to reassemble reads in a specific region of the genome. Assembly is a computationally intensive process that causes runtime bottlenecks. In th ...

GPU Accelerated API for Alignment of Genomics Sequencing Data

Conference paper (2017) - Nauman Ahmed (author), Nauman Ahmed (author), N. Ahmed (author), Hamid Mushtaq (author), H. Mushtaq (author), K Bertels (author), K Bertels (author), KLM Bertels (author), KLM Bertels (author), K.L.M. Bertels (author), K.L.M. Bertels (author), K. Bertels (author), K. Bertels (author), Koen Bertels (author), Koen Bertels (author), Zaid Al-Ars (author), Zaid Al-Ars (author), Z. Al-Ars (author), Z Al-Ars (author)

Sequence alignment is a core step in the processing of DNA and RNA sequencing data. In this paper, we present a high performance GPU accelerated library (GASAL) for pairwise sequence alignment of DNA and RNA sequences. The GASAL library provides accelerated kernels for local, glo ...

Streaming Distributed DNA Sequence Alignment Using Apache Spark

Conference paper (2017) - Hamid Mushtaq (author), H. Mushtaq (author), Nauman Ahmed (author), N. Ahmed (author), Nauman Ahmed (author), Zaid Al-Ars (author), Zaid Al-Ars (author), Z Al-Ars (author), Z. Al-Ars (author)

The large amount of data generated by NextGeneration Sequencing (NGS) technology, usually in the order of hundreds of gigabytes per experiment, has to be analyzed quickly to generate meaningful variant results. The first step
in analyzing such data is to map those sequenced r ...

Predictive Genome Analysis Using Partial DNA Sequencing Data

Conference paper (2017) - N. Ahmed (author), Nauman Ahmed (author), Nauman Ahmed (author), KLM Bertels (author), K.L.M. Bertels (author), K. Bertels (author), Koen Bertels (author), K Bertels (author), Zaid Al-Ars (author), Z Al-Ars (author), Zaid Al-Ars (author), Z. Al-Ars (author)

Much research has been dedicated to reducing the computational time associated with the analysis of genome data, which resulted in shifting the bottleneck from the time needed for the computational analysis part to the actual time needed for sequencing of DNA information. DNA seq ...

A Comparison of Seed-and-Extend Techniques in Modern DNA Read Alignment Algorithms

Conference paper (2016) - N. Ahmed (author), Nauman Ahmed (author), Nauman Ahmed (author), KLM Bertels (author), KLM Bertels (author), K.L.M. Bertels (author), K.L.M. Bertels (author), K. Bertels (author), K. Bertels (author), Koen Bertels (author), Koen Bertels (author), K Bertels (author), K Bertels (author), Zaid Al-Ars (author), Z Al-Ars (author), Zaid Al-Ars (author), Z. Al-Ars (author)

DNA read alignment is a major step in genome analysis. However, as DNA reads continue to become longer, new approaches need to be developed to effectively use these longer reads in the alignment process. Modern aligners commonly use a two-step approach for read alignment: 1. seed ...

Heterogeneous hardware/software acceleration of the BWA-MEM DNA alignment algorithm

Conference paper (2016) - Nauman Ahmed (author), Nauman Ahmed (author), N. Ahmed (author), Vlad-Mihai Sima (author), V.M. Sima (author), Vlad Sima (author), VM Sima (author), E.J. Houtgast (author), Ernst Joachim Houtgast (author), Ernst Houtgast (author), K.L.M. Bertels (author), KLM Bertels (author), K Bertels (author), Koen Bertels (author), K. Bertels (author), Z Al-Ars (author), Z. Al-Ars (author), Zaid Al-Ars (author), Zaid Al-Ars (author)

The fast decrease in cost of DNA sequencing has resulted in an enormous growth in available genome data, and hence led to an increasing demand for fast DNA analysis algorithms used for diagnostics of genetic disorders, such as cancer. One of the most computationally intensive ste ...