Sign In

Communications of the ACM

ACM TechNews

Finding the Branches on the Tree of Life

View as: Print Mobile App Share:

Said the University of Waterloo's Lila Kari, "To the best of our knowledge, this is the first alignment-free method that uses deep neural networks for unsupervised clustering of unlabelled DNA sequences.”

Credit: University of Waterloo Cheriton School of Computer Science

The Deep Learning for Unsupervised Clustering of DNA Sequences (DeLUCS) technique developed by researchers at Canada's University of Waterloo and Western University draws taxonomic relationships between organisms via unsupervised machine learning.

Waterloo's Lila Kari said DeLUCS determines these relationships "across a range of genetic datasets from organisms as diverse as vertebrates, bacteria, and viruses."

The researchers compared genomes of organisms using frequency chaos game representation (FCGR), a graphical depiction of base sequences in DNA showing how often a particular nucleotide sequence occurs.

The process generates FCGR pairs of sequences and mimics as input to an artificial neural network, "from which it finds patterns that can be used to create clusters," Kari said. "This method has an accuracy of almost 80%, and often much better."

From University of Waterloo Cheriton School of Computer Science (Canada)
View Full Article


Abstracts Copyright © 2022 SmithBucklin, Washington, DC, USA


No entries found

Sign In for Full Access
» Forgot Password? » Create an ACM Web Account