Toggle light / dark theme

Topsicle: a method for estimating telomere length from whole genome long-read sequencing data

Long read sequencing technology (advanced by Pacific Biosciences (PacBio) and Oxford Nanopore Technologies (Nanopore)) is revolutionizing the genomics field [43] and it has major potential to be a powerful computational tool for investigating the telomere length variation within populations and between species. Read length from long read sequencing platforms is orders of magnitude longer than short read sequencing platforms (tens of kilobase pairs versus 100–300 bp). These long reads have greatly aided in resolving the complex and highly repetitive regions of the genome [44], and near gapless genome assemblies (also known as telomere-to-telomere assembly) are generated for multiple organisms [45, 46]. The long read sequences can also be used for estimating telomere length, since whole genome sequencing using a long read sequencing platform would contain reads that span the entire telomere and subtelomere region. Computational methods can then be developed to determine the telomere–subtelomere boundary and use it to estimate the telomere length. As an example, telomere-to-telomere assemblies have been used for estimating telomere length by analyzing the sequences at the start and end of the gapless chromosome assembly [47,48,49,50]. But generating gapless genome assemblies is resource intensive and cannot be used for estimating the telomeres of multiple individuals. Alternatively, methods such as TLD [51], Telogator [52], and TeloNum [53] analyze raw long read sequences to estimate telomere lengths. These methods require a known telomere repeat sequence but this can be determined through k-mer based analysis [54]. Specialized methods have also been developed to concentrate long reads originating from chromosome ends. These methods involve attaching sequencing adapters that are complementary to the single-stranded 3′ G-overhang of the telomere, which can subsequently be used for selectively amplifying the chromosome ends for long read sequencing [55,56,57,58]. While these methods can enrich telomeric long reads, they require optimization of the protocol (e.g., designing the adapter sequence to target the G-overhang) and organisms with naturally blunt-ended telomeres [59, 60] would have difficulty implementing the methods.

An explosion of long read sequencing data has been generated for many organisms across the animal and plant kingdom [61, 62]. A computational method that can use this abundant long read sequencing data and estimate telomere length with minimal requirements can be a powerful toolkit for investigating the biology of telomere length variation. But so far, such a method is not available, and implementing one would require addressing two major algorithmic considerations before it can be widely used across many different organisms. The first algorithmic consideration is the ability to analyze the diverse telomere sequence variation across the tree of life. All vertebrates have an identical telomere repeat motif TTAGGG [63] and most previous long read sequencing based computational methods were largely designed for analyzing human genomic datasets where the algorithms are optimized on the TTAGGG telomere motif. But the telomere repeat motif is highly diverse across the animal and plant kingdom [64,65,66,67], and there are even species in fungi and plants that utilize a mix of repeat motifs, resulting in a sequence complex telomere structure [64, 68, 69]. A new computational method would need to accommodate the diverse telomere repeat motifs, especially across the inherently noisy and error-prone long read sequencing data [70]. With recent improvements in sequencing chemistry and technology (HiFi sequencing for PacBio and Q20 + Chemistry kit for Nanopore) error rates have been substantially reduced to 1% [71, 72]. But even with this low error rate, a telomeric region that is several kilobase pairs long can harbor substantial erroneous sequences across the read [73] and hinder the identification of the correct telomere–subtelomere boundary. In addition, long read sequencers are especially error-prone to repetitive homopolymer sequences [74,75,76], and the GT-rich microsatellite telomere sequences are predicted to be an especially erroneous region for long read sequencing. A second algorithmic consideration relates to identifying the telomere–subtelomere boundary. Prior long read sequencing based methods [51, 52] have used sliding windows to calculate summary statistics and a threshold to determine the boundary between the telomere and subtelomere. Sliding window and threshold based analyses are commonly used in genome analysis, but they place the burden on the user to determine the appropriate cutoff, which for telomere length measuring computational methods may differ depending on the sequenced organism. In addition, threshold based sliding window scans can inflate both false positive and false negative results [77,78,79,80,81,82] if the cutoff is improperly determined.

Here, we introduce Topsicle, a computational method that uses a novel strategy to estimate telomere lengths from raw long read sequences from the entire whole genome sequencing library. Methodologically, Topsicle iterates through different substring sizes of the telomere repeat sequence (i.e., telomere k-mer) and different phases of the telomere k-mer are used to summarize the telomere repeat content of each sequencing read. The k-mer based summary statistics of telomere repeats are then used for selecting long reads originating from telomeric regions. Topsicle uses those putative reads from the telomere region to estimate the telomere length by determining the telomere–subtelomere boundary through a binary segmentation change point detection analysis [83]. We demonstrate the high accuracy of Topsicle through simulations and apply our new method on long read sequencing datasets from three evolutionarily diverse plant species (A. thaliana, maize, and Mimulus) and human cancer cell lines. We believe using Topsicle will enable high-resolution explorations of telomere length for more species and achieve a broad understanding of the genetics and evolution underlying telomere length variation.

How a Molecular Motor Minimizes Energy Waste

Turning a biologically important molecular motor at a constant rate saves energy, according to experiments.

Within every biological cell is an enzyme, called adenosine triphosphate (ATP) synthase, that churns out energy-rich molecules for fueling the cell’s activity. New experiments investigate the functioning of this “energy factory” by artificially cranking one of the enzyme’s molecular motors [1]. The results suggest that maintaining a fixed rotation rate minimizes energy waste caused by microscopic fluctuations. Future work could confirm the role of efficiency in the evolutionary design of biological motors.

ATP synthase consists of two rotating molecular motors, Fo and F1, that are oriented along a common rotation axis and locked together so that the rotation of Fo exerts a torque on the shaft in the middle of F1. The resulting motion within F1 helps bring together the chemical ingredients of the molecule ATP, which stores energy that can later be used in cellular processes.

Valproate-vitamin E co-treatment preserved cortico-callosal white matter integrities in cypermethrin co-exposed pentylene tetrazole induced seizure

Epilepsy is characterized by recurrent seizures and neurological consequences, which may be associated with impaired myelin and glial integrity, and exacerbated by environmental neurotoxicants. Environmental neurotoxicants, such as Cypermethrin (CPM), may heighten these impairments, worsening seizure outcomes. This study investigates the effects of Cypermethrin (CPM) on Pentylenetetrazole (PTZ)-induced seizures and the Vitamin E (Vit E) and valproate (VAP) co-interventions on myelin and glial integrity.

Histochemical and immunohistochemical analyses for hematoxylin and eosin (H&E), myelin basic protein (MBP), ionized calcium-binding adaptor molecule 1 (IBA1), glial fibrillary acidic protein (GFAP), and oligodendrocyte transcription factor 2 (OLIG-2) were conducted on cerebral white matter and corpus callosum tissues. The density of stained cells and immunoreactivity obtained with ImageJ was subjected to one-way analysis of variance.

Immunohistochemistry revealed that cypermethrin exposure in PTZ-induced seizure rats led to marked neuronal, oligodendroglial, and myelin loss, accompanied by substantial glial activation in both cerebral white matter and corpus callosum. Interventional ingestions of VAP and Vit E, especially when combined, substantially reduced both microglial activation and reactive astrogliosis, thereby consequently preventing oligodendrocyte and neuronal loss, thus preserving both cerebral white matter and callosal myelin.

Supercomputer modeling unlocks longstanding mystery of subducted oceanic slabs

An international research collaboration has harnessed supercomputing power to better understand how massive slabs of ancient ocean floors are shaped as they sink hundreds of kilometers below Earth’s surface.

Sophisticated computer models developed by researchers in the UK, Switzerland and the U.S. have cast new light on the complex physical interactions which govern the sliding and sinking of the ancient ocean floor, also referred to as subducted slabs, through Earth’s mantle, a process known as subduction.

Researchers from the University of Glasgow led the study. Their paper, “The Role of the Overriding Plate and Mantle Viscosity Structure on Deep Slab Morphology,” is published in Geochemistry, Geophysics, Geosystems.

Spinel-type sulfide semiconductors achieve room-temperature light emission across violet to orange spectrum

A spinel-type sulfide semiconductor that can emit light from violet to orange at room temperature has been developed by researchers at Science Tokyo, overcoming the efficiency limitations of current LED and solar cell materials. The material, (Zn, Mg)Sc2S4, can be chemically tuned to switch between n-type and p-type conduction, leading to future pn homojunction devices. This versatile semiconductor offers a practical path toward the development of more efficient LEDs and solar cells.

Novel method for controlling Faraday rotation in conductive polymers

Researchers at the University of Tsukuba have developed a novel method for controlling the optical rotation of conductive polymer polythiophene in a magnetic field at low voltage. This method combines the “Faraday rotation” phenomenon, in which a polarizing plane rotates in response to a magnetic field, with the electrochemical oxidation and reduction of conductive polymers.

The study is published in the journal Molecular Crystals and Liquid Crystals.

Conductive polymers possess various properties in addition to conductivity, with applications in light-emitting devices, electromagnetic wave shielding, and anticorrosion materials.

Molecular qubits can communicate at telecom frequencies

A team of scientists from the University of Chicago, the University of California Berkeley, Argonne National Laboratory, and Lawrence Berkeley National Laboratory has developed molecular qubits that bridge the gap between light and magnetism—and operate at the same frequencies as telecommunications technology. The advance, published today in Science, establishes a promising new building block for scalable quantum technologies that can integrate seamlessly with existing fiber-optic networks.

Because the new molecular qubits can interact at telecom-band frequencies, the work points toward future quantum networks—sometimes called the “.” Such networks could enable ultra-secure communication channels, connect quantum computers across long distances, and distribute quantum sensors with unprecedented precision.

Molecular qubits could also serve as highly sensitive quantum sensors; their tiny size and chemical flexibility mean they could be embedded in unusual environments—such as —to measure magnetic fields, temperature, or pressure at the nanoscale. And because they are compatible with silicon photonics, these molecules could be integrated directly into chips, paving the way for compact quantum devices that could be used for computing, communication, or sensing.

Meet Irene Curie, the Nobel-winning atomic physicist who changed the course of modern cancer treatment

The adage goes “like mother like daughter,” and in the case of Irene Joliot-Curie, truer words were never spoken. She was the daughter of two Nobel Prize laureates, Marie Curie and Pierre Curie, and was herself awarded the Nobel Prize in chemistry in 1935 together with her husband, Frederic Joliot.

While her parents received the prize for the discovery of natural radioactivity, Irene’s prize was for the synthesis of artificial radioactivity. This discovery changed many fields of science and many aspects of our everyday lives. Artificial radioactivity is used today in medicine, agriculture, energy production, food sterilization, industrial quality control and more.

We are two nuclear physicists who perform experiments at different accelerator facilities around the world. Irene’s discovery laid the foundation for our experimental studies, which use artificial radioactivity to understand questions related to astrophysics, energy, medicine and more.

Heat-rechargeable computation in DNA logic circuits and neural networks

Heat recharges enzyme-free DNA circuits, enabling complex logic operations and neural networks to perform multiple computations, offering a universal energy source for molecular machines and advancing autonomous behaviours in artificial chemical systems.

Experiment explores contribution of neural, epigenetic and behavioral factors to autism spectrum disorder

Autism spectrum disorder (ASD) is a neurodevelopmental disorder that is estimated to be experienced by roughly 1 in 127 people worldwide. It is characterized by atypical patterns in brain development, which manifest in differences in communication, social interactions, behavior and responses to sensory information.

Past neuroscientific and suggest that a variety of factors contribute to the development of ASD. These can include , chemical alterations that influence the expressions of genes (i.e., epigenetic factors), differences in the structure of specific or neural circuits, and environmental factors, such as early life events or infections or immune responses during pregnancy.

Researchers at the Korea Brain Research Institute and University of Fukui in Japan recently carried out a study aimed at further exploring these different dimensions of ASD, focusing on , the communication between brain regions, epigenetic changes and behavioral patterns. Their findings, published in Translational Psychiatry, paint a clearer picture of the intricate underpinnings of the disorder and could inform the development of more precise tools for diagnosing it.

/* */