Constrained Single-Error-Detecting codes for DNA-based Storage Systems

More Info
expand_more

Abstract

The amount of data being produced is growing exponentially. An im-
portant challenge is to find methods to store this data efficiently and in an
environmentally friendly way. One idea that is a growing research topic in-
volves using synthetic DNA. DNA has the potential to be more efficient and
environmentally friendly than current methods. DNA is made of a sequence of
four nucleotides, Adenine (A), Cytosince (C), Guanine (G), and Thymine (T).
To store data, DNA strands can be created with specic nucleotide sequences.
In the process of reading and storing data substitution errors can occur. Two
constraints are introduced to minimise the number of errors. The GC-weight
constraint which states that every DNA sequence must have a fixed number
of G and C nucleotides, and the runlength constraint, which states the maxi-
mum number of repeating nucleotides possible in every DNA sequence.