Author
Listed:
- Neelay Junnarkar
(Department of Electrical Engineering and Computer Sciences, University of California, Berkeley, Berkeley, California 94720)
- Can Kızılkale
(Department of Electrical Engineering and Computer Sciences, University of California, Berkeley, Berkeley, California 94720; and Applied Mathematics and Computational Research Division, Berkeley, California 94720)
- Nevena Golubovic
(Department of Computer Science, University of California, Santa Barbara, Santa Barbara, California 93106)
- Murat Arcak
(Department of Electrical Engineering and Computer Sciences, University of California, Berkeley, Berkeley, California 94720)
- Aydın Buluç
(Department of Electrical Engineering and Computer Sciences, University of California, Berkeley, Berkeley, California 94720; and Applied Mathematics and Computational Research Division, Berkeley, California 94720)
Abstract
Applications such as reconstructing cell lineage trees (represented as phylogenetic trees) from single-cell sequencing data require reconstructing a { 0 , 1 } -matrix that has many errors and missing entries. We introduce Sempervirens, a very fast matrix reconstruction algorithm for noisy and incomplete matrix representations of phylogenetic trees. Sempervirens uses an iterative maximum-likelihood approach to determine the topology tree represented by the corrupted data. We show that Sempervirens is at least three orders of magnitude faster than other methods on thousand by thousand matrices, with the speed gap widening with larger matrices. We also show that Sempervirens matches state-of-the-art methods in reconstruction accuracy. The speed of Sempervirens enables it to be tractably applied to reconstructing much larger matrices than those that other methods can reconstruct. In addition to experimental results, we justify the algorithm with a mathematical treatment of its subprocedures.
Suggested Citation
Neelay Junnarkar & Can Kızılkale & Nevena Golubovic & Murat Arcak & Aydın Buluç, 2026.
"Sempervirens: A Fast Reconstruction Algorithm for Noisy and Incomplete Binary Matrix Representations of Trees,"
INFORMS Journal on Computing, INFORMS, vol. 38(2), pages 590-604, March.
Handle:
RePEc:inm:orijoc:v:38:y:2026:i:2:p:590-604
DOI: 10.1287/ijoc.2023.0373
Download full text from publisher
Corrections
All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:inm:orijoc:v:38:y:2026:i:2:p:590-604. See general information about how to correct material in RePEc.
If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.
We have no bibliographic references for this item. You can help adding them by using this form .
If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Chris Asher (email available below). General contact details of provider: https://edirc.repec.org/data/inforea.html .
Please note that corrections may take a couple of weeks to filter through
the various RePEc services.