Become a Society Member

3-Year SMBE Society Memberships are available for purchase via the "Join or Renew" link below.

Please note that Regular and Student SMBE Society Memberships do not include an MBE print and/or online subscription.

 1-Year Print and/or Online subscriptions to MBE may be purchased in addition to your membership, or separately if you are already an active member.

Questions about membership? Please contact smbe@allenpress.com

Membership Rates

Lifetime Member Worldwide Print&Online


List Price: $500.00
Member Price: $500.00

3-YEAR Society Membership Worldwide No Journal


List Price: $30.00
Member Price: $30.00

3-YEAR Society Membership Student Worldwide No Journal


List Price: $10.00
Member Price: $10.00

@OfficialSMBE Feed

MBE | Most Read

Molecular Biology and Evolution

Society for Molecular Biology and Evolution, Council and Business Meetings, 2019, Manchester, United Kingdom

Mon, 27 Jan 2020 00:00:00 GMT

A Report by the Editor-in-Chief for Molecular Biology and Evolution (MBE), Volume 35

Mon, 27 Jan 2020 00:00:00 GMT

Molecular Biology and Evolution (MBE) is a prominent and highly-respected journal, owned exclusively by the Society for Molecular Biology and Evolution (SMBE) and published by the Oxford University Press. MBE publishes original research at the intersection of Molecular Biology and Evolution, with an emphasis on community resources and tools for analysis that will generate groundbreaking results in evolutionary research. In an increasingly competitive publication landscape, MBE provides society members an opportunity for active participation in the creation of a journal that best suits our community’s needs and interests. Also, the proceeds from MBE have helped to support SMBE annual meetings and community-enhancing activities.

Scientists Identify Rare Evolutionary Intermediates That Help to Understand the Origin of Eukaryotes

Fri, 27 Dec 2019 00:00:00 GMT

A new study by Yale scientists provides a key insight into a milestone event in the early evolution of life on the Earth—the origin of the cell nucleus and complex cells called eukaryotes.

Improved Mapping of Swedish Genes

Fri, 27 Dec 2019 00:00:00 GMT

People—or more specifically just Swedes—are more like chimpanzees than previously known. This is indicated in a genetic mapping of 1,000 Swedish individuals, where new DNA sequences that should be included in the reference genome have been identified. The study is published today in the scientific journal Molecular Biology and Evolution.

Genomic Fluke: Whole-Genome Comparative Analyses of Liver and Intestinal Parasites Reveal Evolutionary History and Shift in Organs, Animals of Choice

Fri, 27 Dec 2019 00:00:00 GMT

Parasitic flukes have been a leading source of food-borne infections, sparking fear and wreaking havoc on human public health, and contributed to more than three billion in animal agricultural losses per year in United States alone.

MBE Emerging Classics 2020

Thu, 21 Nov 2019 00:00:00 GMT

Since 2013, Molecular Biology and Evolution has experimented with criteria to identify “Citation Classics” as a means of celebrating papers with powerful impact on our community as reflected by accumulated citations. We learned from these experiments that our “Early” or “Golden” citation classics remain fairly static from year to year. Some manuscripts have such a strong and lasting impact on our scientific thought and approaches that a yearly affirmation of their influence is unnecessary. Moving forward, MBE will provide yearly recognition to the more recently published manuscripts that have made the strongest impression on scientific literature since their publication, and will revisit the most enduring manuscripts after longer intervals. Below, we highlight ten discovery, five methods and five resource publications as “Emerging Classics” based on citations accrued per fractional years since print publication. Articles are listed from most recent to oldest publication date. Total citation counts were obtained from Web of Science on November 12, 2019. We congratulate these authors on the significance of their contributions and look forward to seeing new classics emerge in the years to come.

Improving Cancer Drug Discovery by Studying Cancer across the Tree of Life

Tue, 05 Nov 2019 00:00:00 GMT

Abstract
Despite a considerable expenditure of time and resources and significant advances in experimental models of disease, cancer research continues to suffer from extremely low success rates in translating preclinical discoveries into clinical practice. The continued failure of cancer drug development, particularly late in the course of human testing, not only impacts patient outcomes, but also drives up the cost for those therapies that do succeed. It is clear that a paradigm shift is necessary if improvements in this process are to occur. One promising direction for increasing translational success is comparative oncology—the study of cancer across species, often involving veterinary patients that develop naturally-occurring cancers. Comparative oncology leverages the power of cross-species analyses to understand the fundamental drivers of cancer protective mechanisms, as well as factors contributing to cancer initiation and progression. Clinical trials in veterinary patients with cancer provide an opportunity to evaluate novel therapeutics in a setting that recapitulates many of the key features of human cancers, including genomic aberrations that underly tumor development, response and resistance to treatment, and the presence of comorbidities that can affect outcomes. With a concerted effort from basic scientists, human physicians and veterinarians, comparative oncology has the potential to enhance the cost-effectiveness and efficiency of pipelines for cancer drug discovery and other cancer treatments.

Reliable Confidence Intervals for RelTime Estimates of Evolutionary Divergence Times

Tue, 22 Oct 2019 00:00:00 GMT

Abstract
Confidence intervals (CIs) depict the statistical uncertainty surrounding evolutionary divergence time estimates. They capture variance contributed by the finite number of sequences and sites used in the alignment, deviations of evolutionary rates from a strict molecular clock in a phylogeny, and uncertainty associated with clock calibrations. Reliable tests of biological hypotheses demand reliable CIs. However, current non-Bayesian methods may produce unreliable CIs because they do not incorporate rate variation among lineages and interactions among clock calibrations properly. Here, we present a new analytical method to calculate CIs of divergence times estimated using the RelTime method, along with an approach to utilize multiple calibration uncertainty densities in dating analyses. Empirical data analyses showed that the new methods produce CIs that overlap with Bayesian highest posterior density intervals. In the analysis of computer-simulated data, we found that RelTime CIs show excellent average coverage probabilities, that is, the actual time is contained within the CIs with a 94% probability. These developments will encourage broader use of computationally efficient RelTime approaches in molecular dating analyses and biological hypothesis testing.

Discovery of Novel Sequences in 1,000 Swedish Genomes

Tue, 24 Sep 2019 00:00:00 GMT

Abstract
Novel sequences (NSs), not present in the human reference genome, are abundant and remain largely unexplored. Here, we utilize de novo assembly to study NS in 1,000 Swedish individuals first sequenced as part of the SweGen project revealing a total of 46 Mb in 61,044 distinct contigs of sequences not present in GRCh38. The contigs were aligned to recently published catalogs of Icelandic and Pan-African NSs, as well as the chimpanzee genome, revealing a great diversity of shared sequences. Analyzing the positioning of NS across the chimpanzee genome, we find that 2,807 NS align confidently within 143 chimpanzee orthologs of human genes. Aligning the whole genome sequencing data to the chimpanzee genome, we discover ancestral NS common throughout the Swedish population. The NSs were searched for repeats and repeat elements: revealing a majority of repetitive sequence (56%), and enrichment of simple repeats (28%) and satellites (15%). Lastly, we align the unmappable reads of a subset of the thousand genomes data to our collection of NS, as well as the previously published Pan-African NS: revealing that both the Swedish and Pan-African NS are widespread, and that the Swedish NSs are largely a subset of the Pan-African NS. Overall, these results highlight the importance of creating a more diverse reference genome and illustrate that significant amounts of the NS may be of ancestral origin.

Stress-Driven Transposable Element De-repression Dynamics and Virulence Evolution in a Fungal Pathogen

Mon, 23 Sep 2019 00:00:00 GMT

Abstract
Transposable elements (TEs) are drivers of genome evolution and affect the expression landscape of the host genome. Stress is a major factor inducing TE activity; however, the regulatory mechanisms underlying de-repression are poorly understood. Plant pathogens are excellent models to dissect the impact of stress on TEs. The process of plant infection induces stress for the pathogen, and virulence factors (i.e., effectors) located in TE-rich regions become expressed. To dissect TE de-repression dynamics and contributions to virulence, we analyzed the TE expression landscape of four strains of the major wheat pathogen Zymoseptoria tritici. We experimentally exposed strains to nutrient starvation and host infection stress. Contrary to expectations, we show that the two distinct conditions induce the expression of different sets of TEs. In particular, the most highly expressed TEs, including miniature inverted-repeat transposable element and long terminal repeat-Gypsy element, show highly distinct de-repression across stress conditions. Both the genomic context of TEs and the genetic background stress (i.e., different strains harboring the same TEs) were major predictors of de-repression under stress. Gene expression profiles under stress varied significantly depending on the proximity to the closest TEs and genomic defenses against TEs were largely ineffective to prevent de-repression. Next, we analyzed the locus encoding the Avr3D1 effector. We show that the insertion and subsequent silencing of TEs in close proximity likely contributed to reduced expression and virulence on a specific wheat cultivar. The complexity of TE responsiveness to stress across genetic backgrounds and genomic locations demonstrates substantial intraspecific genetic variation to control TEs with consequences for virulence.

Comparative Transcriptomics Analyses across Species, Organs, and Developmental Stages Reveal Functionally Constrained lncRNAs

Fri, 20 Sep 2019 00:00:00 GMT

Abstract
The functionality of long noncoding RNAs (lncRNAs) is disputed. In general, lncRNAs are under weak selective pressures, suggesting that the majority of lncRNAs may be nonfunctional. However, although some surveys showed negligible phenotypic effects upon lncRNA perturbation, key biological roles were demonstrated for individual lncRNAs. Most lncRNAs with proven functions were implicated in gene expression regulation, in pathways related to cellular pluripotency, differentiation, and organ morphogenesis, suggesting that functional lncRNAs may be more abundant in embryonic development, rather than in adult organs. To test this hypothesis, we perform a multidimensional comparative transcriptomics analysis, across five developmental time points (two embryonic stages, newborn, adult, and aged individuals), four organs (brain, kidney, liver, and testes), and three species (mouse, rat, and chicken). We find that, overwhelmingly, lncRNAs are preferentially expressed in adult and aged testes, consistent with the presence of permissive transcription during spermatogenesis. LncRNAs are often differentially expressed among developmental stages and are less abundant in embryos and newborns compared with adult individuals, in agreement with a requirement for tighter expression control and less tolerance for noisy transcription early in development. For differentially expressed lncRNAs, we find that the patterns of expression variation among developmental stages are generally conserved between mouse and rat. Moreover, lncRNAs expressed above noise levels in somatic organs and during development show higher evolutionary conservation, in particular, at their promoter regions. Thus, we show that functionally constrained lncRNA loci are enriched in developing organs, and we suggest that many of these loci may function in an RNA-independent manner.

The Genetic Background Modulates the Evolution of Fluoroquinolone-Resistance in Mycobacterium tuberculosis

Wed, 18 Sep 2019 00:00:00 GMT

Abstract
Fluoroquinolones (FQ) form the backbone in experimental treatment regimens against drug-susceptible tuberculosis. However, little is known on whether the genetic variation present in natural populations of Mycobacterium tuberculosis (Mtb) affects the evolution of FQ-resistance (FQ-R). To investigate this question, we used nine genetically distinct drug-susceptible clinical isolates of Mtb and measured their frequency of resistance to the FQ ofloxacin (OFX) in vitro. We found that the Mtb genetic background led to differences in the frequency of OFX-resistance (OFX-R) that spanned two orders of magnitude and substantially modulated the observed mutational profiles for OFX-R. Further, in vitro assays showed that the genetic background also influenced the minimum inhibitory concentration and the fitness effect conferred by a given OFX-R mutation. To test the clinical relevance of our in vitro work, we surveyed the mutational profile for FQ-R in publicly available genomic sequences from clinical Mtb isolates, and found substantial Mtb lineage-dependent variability. Comparison of the clinical and the in vitro mutational profiles for FQ-R showed that 51% and 39% of the variability in the clinical frequency of FQ-R gyrA mutation events in Lineage 2 and Lineage 4 strains, respectively, can be attributed to how Mtb evolves FQ-R in vitro. As the Mtb genetic background strongly influenced the evolution of FQ-R in vitro, we conclude that the genetic background of Mtb also impacts the evolution of FQ-R in the clinic.

The Genome of the Endangered Dryas Monkey Provides New Insights into the Evolutionary History of the Vervets

Mon, 16 Sep 2019 00:00:00 GMT

Abstract
Genomic data can be a powerful tool for inferring ecology, behavior, and conservation needs of highly elusive species, particularly, when other sources of information are hard to come by. Here, we focus on the Dryas monkey (Cercopithecus dryas), an endangered primate endemic to the Congo Basin with cryptic behavior and possibly <250 remaining adult individuals. Using whole-genome sequencing data, we show that the Dryas monkey represents a sister lineage to the vervets (Chlorocebus sp.) and has diverged from them ∼1.4 Ma with additional bidirectional gene flow ∼750,000–∼500,000 years ago that has likely involved the crossing of the Congo River. Together with evidence of gene flow across the Congo River in bonobos and okapis, our results suggest that the fluvial topology of the Congo River might have been more dynamic than previously recognized. Despite the presence of several homozygous loss-of-function mutations in genes associated with sperm mobility and immunity, we find high genetic diversity and low levels of inbreeding and genetic load in the studied Dryas monkey individual. This suggests that the current population carries sufficient genetic variability for long-term survival and might be larger than currently recognized. We thus provide an example of how genomic data can directly improve our understanding of highly elusive species.

Patterns of Genomic Differentiation in the Drosophila nasuta Species Complex

Sun, 15 Sep 2019 00:00:00 GMT

Abstract
The Drosophila nasuta species complex contains over a dozen recently diverged species that are distributed widely across South-East Asia, and which shows varying degrees of pre- and postzygotic isolation. Here, we assemble a high-quality genome for D. albomicans using single-molecule sequencing and chromatin conformation capture, and draft genomes for 11 additional species and 67 individuals across the clade, to infer the species phylogeny and patterns of genetic diversity in this group. Our assembly recovers entire chromosomes, and we date the origin of this radiation ∼2 Ma. Despite low levels of overall differentiation, most species or subspecies show clear clustering into their designated taxonomic groups using population genetics and phylogenetic methods. Local evolutionary history is heterogeneous across the genome, and differs between the autosomes and the X chromosome for species in the sulfurigaster subgroup, likely due to autosomal introgression. Our study establishes the nasuta species complex as a promising model system to further characterize the evolution of pre- and postzygotic isolation in this clade.

Recombining Your Way Out of Trouble: The Genetic Architecture of Hybrid Fitness under Environmental Stress

Fri, 13 Sep 2019 00:00:00 GMT

Abstract
Hybridization between species can either promote or impede adaptation. But we know very little about the genetic basis of hybrid fitness, especially in nondomesticated organisms, and when populations are facing environmental stress. We made genetically variable F2 hybrid populations from two divergent Saccharomyces yeast species. We exposed populations to ten toxins and sequenced the most resilient hybrids on low coverage using ddRADseq to investigate four aspects of their genomes: 1) hybridity, 2) interspecific heterozygosity, 3) epistasis (positive or negative associations between nonhomologous chromosomes), and 4) ploidy. We used linear mixed-effect models and simulations to measure to which extent hybrid genome composition was contingent on the environment. Genomes grown in different environments varied in every aspect of hybridness measured, revealing strong genotype–environment interactions. We also found selection against heterozygosity or directional selection for one of the parental alleles, with larger fitness of genomes carrying more homozygous allelic combinations in an otherwise hybrid genomic background. In addition, individual chromosomes and chromosomal interactions showed significant species biases and pervasive aneuploidies. Against our expectations, we observed multiple beneficial, opposite-species chromosome associations, confirmed by epistasis- and selection-free computer simulations, which is surprising given the large divergence of parental genomes (∼15%). Together, these results suggest that successful, stress-resilient hybrid genomes can be assembled from the best features of both parents without paying high costs of negative epistasis. This illustrates the importance of measuring genetic trait architecture in an environmental context when determining the evolutionary potential of genetically diverse hybrid populations.

Gene Expression Modularity Reveals Footprints of Polygenic Adaptation in Theobroma cacao

Thu, 12 Sep 2019 00:00:00 GMT

Abstract
Separating footprints of adaptation from demography is challenging. When selection has acted on a single locus with major effect, this issue can be alleviated through signatures left by selective sweeps. However, as adaptation is often driven by small allele frequency shifts at many loci, studies focusing on single genes are able to identify only a small portion of genomic variants responsible for adaptation. In face of this challenge, we utilize coexpression information to search for signals of polygenetic adaptation in Theobroma cacao, a tropical tree species that is the source of chocolate. Using transcriptomics and a weighted correlation network analysis, we group genes with similar expression patterns into functional modules. We then ask whether modules enriched for specific biological processes exhibit cumulative effects of differential selection in the form of high FST and dXY between populations. Indeed, modules putatively involved in protein modification, flowering, and water transport show signs of polygenic adaptation even though individual genes that are members of those groups do not bear strong signatures of selection. Modeling of demography, background selection, and the effects of genomic features reveal that these patterns are unlikely to arise by chance. We also find that specific modules are enriched for signals of strong or relaxed purifying selection, with one module bearing signs of adaptive differentiation and an excess of deleterious mutations. Our results provide insight into polygenic adaptation and contribute to understanding of population structure, demographic history, and genome evolution in T. cacao.

Genomic and Phenotypic Analyses Reveal Mechanisms Underlying Homing Ability in Pigeon

Tue, 10 Sep 2019 00:00:00 GMT

Abstract
The homing pigeon was selectively bred from the domestic pigeon for a homing ability over long distances, a very fascinating but complex behavioral trait. Here, we generate a total of 95 whole genomes from diverse pigeon breeds. Comparing the genomes from the homing pigeon population with those from other breeds identifies candidate positively selected genes, including many genes involved in the central nervous system, particularly spatial learning and memory such as LRP8. Expression profiling reveals many neuronal genes displaying differential expression in the hippocampus, which is the key organ for memory and navigation and exhibits significantly larger size in the homing pigeon. In addition, we uncover a candidate gene GSR (encoding glutathione-disulfide reductase) experiencing positive selection in the homing pigeon. Expression profiling finds that GSR is highly expressed in the wattle and visual pigment cell layer, and displays increased expression levels in the homing pigeon. In vitro, a magnetic field stimulates increases in calcium ion concentration in cells expressing pigeon GSR. These findings support the importance of the hippocampus (functioning in spatial memory and navigation) for homing ability, and the potential involvement of GSR in pigeon magnetoreception.

Archaeal Ribosomal Proteins Possess Nuclear Localization Signal-Type Motifs: Implications for the Origin of the Cell Nucleus

Tue, 10 Sep 2019 00:00:00 GMT

Abstract
Eukaryotic cells are divided into the nucleus and the cytosol, and, to enter the nucleus, proteins typically possess short signal sequences, known as nuclear localization signals (NLSs). Although NLSs have long been considered as features unique to eukaryotic proteins, we show here that similar or identical protein segments are present in ribosomal proteins from the Archaea. Specifically, the ribosomal proteins uL3, uL15, uL18, and uS12 possess NLS-type motifs that are conserved across all major branches of the Archaea, including the most ancient groups Microarchaeota and Diapherotrites, pointing to the ancient origin of NLS-type motifs in the Archaea. Furthermore, by using fluorescence microscopy, we show that the archaeal NLS-type motifs can functionally substitute eukaryotic NLSs and direct the transport of ribosomal proteins into the nuclei of human cells. Collectively, these findings illustrate that the origin of NLSs preceded the origin of the cell nucleus, suggesting that the initial function of NLSs was not related to intracellular trafficking, but possibly was to improve recognition of nucleic acids by cellular proteins. Overall, our study reveals rare evolutionary intermediates among archaeal cells that can help elucidate the sequence of events that led to the origin of the eukaryotic cell.

Adaptive Radiation of the Flukes of the Family Fasciolidae Inferred from Genome-Wide Comparisons of Key Species

Tue, 10 Sep 2019 00:00:00 GMT

Abstract
Liver and intestinal flukes of the family Fasciolidae cause zoonotic food–borne infections that impact both agriculture and human health throughout the world. Their evolutionary history and the genetic basis underlying their phenotypic and ecological diversity are not well understood. To close that knowledge gap, we compared the whole genomes of Fasciola hepatica, Fasciola gigantica, and Fasciolopsis buski and determined that the split between Fasciolopsis and Fasciola took place ∼90 Ma in the late Cretaceous period, and that between 65 and 50 Ma an intermediate host switch and a shift from intestinal to hepatic habitats occurred in the Fasciola lineage. The rapid climatic and ecological changes occurring during this period may have contributed to the adaptive radiation of these flukes. Expansion of cathepsins, fatty-acid-binding proteins, protein disulfide-isomerases, and molecular chaperones in the genus Fasciola highlights the significance of excretory–secretory proteins in these liver-dwelling flukes. Fasciola hepatica and Fasciola gigantica diverged ∼5 Ma near the Miocene–Pliocene boundary that coincides with reduced faunal exchange between Africa and Eurasia. Severe decrease in the effective population size ∼10 ka in Fasciola is consistent with a founder effect associated with its recent global spread through ruminant domestication. G-protein-coupled receptors may have key roles in adaptation of physiology and behavior to new ecological niches. This study has provided novel insights about the genome evolution of these important pathogens, has generated genomic resources to enable development of improved interventions and diagnosis, and has laid a solid foundation for genomic epidemiology to trace drug resistance and to aid surveillance.

“Lost and Found”: snoRNA Annotation in the Xenopus Genome and Implications for Evolutionary Studies

Fri, 06 Sep 2019 00:00:00 GMT

Abstract
Small nucleolar RNAs (snoRNAs) function primarily as guide RNAs for posttranscriptional modification of rRNAs and spliceosomal snRNAs, both of which are functionally important and evolutionarily conserved molecules. It is commonly believed that snoRNAs and the modifications they mediate are highly conserved across species. However, most relevant data on snoRNA annotation and RNA modification are limited to studies on human and yeast. Here, we used RNA-sequencing data from the giant oocyte nucleus of the frog Xenopus tropicalis to annotate a nearly complete set of snoRNAs. We compared the frog data with snoRNA sets from human and other vertebrate genomes, including mammals, birds, reptiles, and fish. We identified many Xenopus-specific (or nonhuman) snoRNAs and Xenopus-specific domains in snoRNAs from conserved RNA families. We predicted that some of these nonhuman snoRNAs and domains mediate modifications at unexpected positions in rRNAs and snRNAs. These modifications were mapped as predicted when RNA modification assays were applied to RNA from nine vertebrate species: frogs X. tropicalis and X. laevis, newt Notophthalmus viridescens, axolotl Ambystoma mexicanum, whiptail lizard Aspidoscelis neomexicana, zebrafish Danio rerio, chicken, mouse, and human. This analysis revealed that only a subset of RNA modifications is evolutionarily conserved and that modification patterns may vary even between closely related species. We speculate that each functional domain in snoRNAs (half of an snoRNA) may evolve independently and shuffle between different snoRNAs.

Evolutionary History of GLIS Genes Illuminates Their Roles in Cell Reprograming and Ciliogenesis

Thu, 05 Sep 2019 00:00:00 GMT

Abstract
The GLIS family transcription factors, GLIS1 and GLIS3, potentiate generation of induced pluripotent stem cells (iPSCs). In contrast, another GLIS family member, GLIS2, suppresses cell reprograming. To understand how these disparate roles arose, we examined evolutionary origins and genomic organization of GLIS genes. Comprehensive phylogenetic analysis shows that GLIS1 and GLIS3 originated during vertebrate whole genome duplication, whereas GLIS2 is a sister group to the GLIS1/3 and GLI families. This result is consistent with their opposing functions in cell reprograming. Glis1 evolved faster than Glis3, losing many protein-interacting motifs. This suggests that Glis1 acquired new functions under weakened evolutionary constraints. In fact, GLIS1 induces induced pluripotent stem cells more strongly. Transcriptomic data from various animal embryos demonstrate that glis1 is maternally expressed in some tetrapods, whereas vertebrate glis3 and invertebrate glis1/3 genes are rarely expressed in oocytes, suggesting that vertebrate (or tetrapod) Glis1 acquired a new expression domain and function as a maternal factor. Furthermore, comparative genomic analysis reveals that glis1/3 is part of a bilaterian-specific gene cluster, together with rfx3, ndc1, hspb11, and lrrc42. Because known functions of these genes are related to cilia formation and function, the last common ancestor of bilaterians may have acquired this cluster by shuffling gene order to establish more sophisticated epithelial tissues involving cilia. This evolutionary study highlights the significance of GLIS1/3 for cell reprograming, development, and diseases in ciliated organs such as lung, kidney, and pancreas.

Can the RNA World Still Function without Cytidine?

Wed, 04 Sep 2019 00:00:00 GMT

Abstract
Most scenarios for the origin of life assume that RNA played a key role in both catalysis and information storage. The A, U, G, and C nucleobases in modern RNA all participate in secondary structure formation and replication. However, the rapid deamination of C to U and the absence of C in meteorite samples suggest that prebiotic RNA may have been deficient in cytosine. Here, we assess the ability of RNA sequences formed from a three-letter AUG alphabet to perform both structural and genetic roles in comparison to sequences formed from the AUGC alphabet. Despite forming less thermodynamically stable helices, the AUG alphabet can find a broad range of structures and thus appears sufficient for catalysis in the RNA World. However, in the AUG case, longer sequences are required to form structures with an equivalent complexity. Replication in the AUG alphabet requires GU pairing. Sequence fidelity in the AUG alphabet is low whenever G’s are present in the sequence. We find that AUG sequences evolve to AU sequences if GU pairing is rare, and to RU sequences if GU pairing is common (R denotes A or G). It is not possible to conserve a G at a specific site in either case. These problems do not rule out the possibility of an RNA World based on AUG, but they show that it wouldbe significantly more difficult than with a four-base alphabet.

Stochastic Gene Expression Influences the Selection of Antibiotic Resistance Mutations

Wed, 04 Sep 2019 00:00:00 GMT

Abstract
Bacteria can resist antibiotics by expressing enzymes that remove or deactivate drug molecules. Here, we study the effects of gene expression stochasticity on efflux and enzymatic resistance. We construct an agent-based model that stochastically simulates multiple biochemical processes in the cell and we observe the growth and survival dynamics of the cell population. Resistance-enhancing mutations are introduced by varying parameters that control the enzyme expression or efficacy. We find that stochastic gene expression can cause complex dynamics in terms of survival and extinction for these mutants. Regulatory mutations, which augment the frequency and duration of resistance gene transcription, can provide limited resistance by increasing mean expression. Structural mutations, which modify the enzyme or efflux efficacy, provide most resistance by improving the binding affinity of the resistance protein to the antibiotic; increasing the enzyme’s catalytic rate alone may contribute to resistance if drug binding is not rate limiting. Overall, we identify conditions where regulatory mutations are selected over structural mutations, and vice versa. Our findings show that stochastic gene expression is a key factor underlying efflux and enzymatic resistances and should be taken into consideration in future antibiotic research.

Polymorphism Data Assist Estimation of the Nonsynonymous over Synonymous Fixation Rate Ratio ω for Closely Related Species

Fri, 30 Aug 2019 00:00:00 GMT

Abstract
The ratio of nonsynonymous over synonymous sequence divergence, dN/dS, is a widely used estimate of the nonsynonymous over synonymous fixation rate ratio ω, which measures the extent to which natural selection modulates protein sequence evolution. Its computation is based on a phylogenetic approach and computes sequence divergence of protein-coding DNA between species, traditionally using a single representative DNA sequence per species. This approach ignores the presence of polymorphisms and relies on the indirect assumption that new mutations fix instantaneously, an assumption which is generally violated and reasonable only for distantly related species. The violation of the underlying assumption leads to a time-dependence of sequence divergence, and biased estimates of ω in particular for closely related species, where the contribution of ancestral and lineage-specific polymorphisms to sequence divergence is substantial. We here use a time-dependent Poisson random field model to derive an analytical expression of dN/dS as a function of divergence time and sample size. We then extend our framework to the estimation of the proportion of adaptive protein evolution α. This mathematical treatment enables us to show that the joint usage of polymorphism and divergence data can assist the inference of selection for closely related species. Moreover, our analytical results provide the basis for a protocol for the estimation of ω and α for closely related species. We illustrate the performance of this protocol by studying a population data set of four corvid species, which involves the estimation of ω and α at different time-scales and for several choices of sample sizes.

Legacy Data Confound Genomics Studies

Fri, 30 Aug 2019 00:00:00 GMT

Abstract
Recent reports have identified differences in the mutational spectra across human populations. Although some of these reports have been replicated in other cohorts, most have been reported only in the 1000 Genomes Project (1kGP) data. While investigating an intriguing putative population stratification within the Japanese population, we identified a previously unreported batch effect leading to spurious mutation calls in the 1kGP data and to the apparent population stratification. Because the 1kGP data are used extensively, we find that the batch effects also lead to incorrect imputation by leading imputation servers and a small number of suspicious GWAS associations. Lower quality data from the early phases of the 1kGP thus continue to contaminate modern studies in hidden ways. It may be time to retire or upgrade such legacy sequencing data.

Ancient Mitochondrial Gene Transfer between Fungi and the Orchids

Thu, 29 Aug 2019 00:00:00 GMT

Abstract
The mitochondrial genomes (mitogenomes) of plants are known to incorporate and accumulate DNA from intra- and extracellular donors. Despite the intimate relationships formed between flowing plants (angiosperms) and fungi, lengthy fungal-like sequence has not been identified in angiosperm mitogenomes to date. Here, we present multiple lines of evidence documenting horizontal gene transfer (HGT) between the mitogenomes of fungi and the ancestors of the orchids, plants that are obligate parasites of fungi during their early development. We show that the ancestor of the orchids acquired an ∼270-bp fungal mitogenomic region containing three transfer RNA genes. We propose that the short HGT was later replaced by a second HGT event transferring >8 kb and 14 genes from a fungal mitogenome to that of the ancestor of the largest orchid subfamily, Epidendroideae. Our results represent the first evidence of genomic-scale HGT between fungal and angiosperm mitogenomes and demonstrate that the length intergenic spacer regions of angiosperm mitogenomes can effectively fossilize the genomic remains of ancient, nonplant organisms.

Phylogenetic Shifts in Gene Body Methylation Correlate with Gene Expression and Reflect Trait Conservation

Tue, 27 Aug 2019 00:00:00 GMT

Abstract
A subset of genes in plant genomes are labeled with DNA methylation specifically at CG residues. These genes, known as gene-body methylated (gbM), have a number of associated characteristics. They tend to have longer sequences, to be enriched for intermediate expression levels, and to be associated with slower rates of molecular evolution. Most importantly, gbM genes tend to maintain their level of DNA methylation between species, suggesting that this trait is under evolutionary constraint. Given the degree of conservation in gbM, we still know surprisingly little about its function in plant genomes or whether gbM is itself a target of selection. To address these questions, we surveyed DNA methylation across eight grass (Poaceae) species that span a gradient of genome sizes. We first established that genome size correlates with genome-wide DNA methylation levels, but less so for genic levels. We then leveraged genomic data to identify a set of 2,982 putative orthologs among the eight species and examined shifts of methylation status for each ortholog in a phylogenetic context. A total of 55% of orthologs exhibited a shift in gbM, but these shifts occurred predominantly on terminal branches, indicating that shifts in gbM are rarely conveyed over time. Finally, we found that the degree of conservation of gbM across species is associated with increased gene length, reduced rates of molecular evolution, and increased gene expression level, but reduced gene expression variation across species. Overall, these observations suggest a basis for evolutionary pressure to maintain gbM status over evolutionary time.

HyPhy 2.5—A Customizable Platform for Evolutionary Hypothesis Testing Using Phylogenies

Tue, 27 Aug 2019 00:00:00 GMT

Abstract
HYpothesis testing using PHYlogenies (HyPhy) is a scriptable, open-source package for fitting a broad range of evolutionary models to multiple sequence alignments, and for conducting subsequent parameter estimation and hypothesis testing, primarily in the maximum likelihood statistical framework. It has become a popular choice for characterizing various aspects of the evolutionary process: natural selection, evolutionary rates, recombination, and coevolution. The 2.5 release (available from www.hyphy.org) includes a completely re-engineered computational core and analysis library that introduces new classes of evolutionary models and statistical tests, delivers substantial performance and stability enhancements, improves usability, streamlines end-to-end analysis workflows, makes it easier to develop custom analyses, and is mostly backward compatible with previous HyPhy releases.

ModelTest-NG: A New and Scalable Tool for the Selection of DNA and Protein Evolutionary Models

Wed, 21 Aug 2019 00:00:00 GMT

Abstract
ModelTest-NG is a reimplementation from scratch of jModelTest and ProtTest, two popular tools for selecting the best-fit nucleotide and amino acid substitution models, respectively. ModelTest-NG is one to two orders of magnitude faster than jModelTest and ProtTest but equally accurate and introduces several new features, such as ascertainment bias correction, mixture, and free-rate models, or the automatic processing of single partitions. ModelTest-NG is available under a GNU GPL3 license at https://github.com/ddarriba/modeltest , last accessed September 2, 2019.

GBE | Most Read

Genome Biology & Evolution

Highlight—Blind as a Bat? The Genetic Basis of Echolocation in Bats and Whales

Mon, 27 Jan 2020 00:00:00 GMT

Clicks, squeaks, chirps, buzzes, etc. though they may be difficult to distinguish to our ears, such sounds are used by echolocating animals to paint a vivid picture of their surroundings. By generating a sound and then listening to how the sound waves bounce off of objects around them, these animals are able to “see” using sound. Although a number of species engage in some form of echolocation, including some birds, shrews, and even humans, the echolocation systems of bats and toothed whales (including dolphins, porpoises, killer whales, and sperm whales) are exquisitely sophisticated. Echolocation evolved independently in these animals (fig. 1) under conditions of poor visibility—the night sky for bats and deep underwater for toothed whales—enabling them to hunt for prey and navigate in complete darkness. It is a fascinating example of convergent evolution, the process by which distantly related organisms evolve similar features or adaptations. To better understand how echolocation evolved in these species, a new study in Genome Biology and Evolution, titled “Evolutionary basis of high-frequency hearing in the cochleae of echolocators revealed by comparative genomics,” takes advantage of advances in genomic analysis to investigate the origin and evolution of high-frequency hearing, an adaptation that allows echolocators to perceive ultrasonic signals.

Differential Expression in Testis and Liver Transcriptomes from Four Species of Peromyscus (Rodentia: Cricetidae)

Tue, 07 Jan 2020 00:00:00 GMT

Abstract
The genus Peromyscus represents a rapidly diverged clade of Cricetid rodents that contains multiple cryptic species and has a propensity for morphologic conservation across its members. The unresolved relationships in previously proposed phylogenies reflect a suspected rapid adaptive radiation. To identify functional groups of genes that may be important in reproductive isolation in a reoccurring fashion across the Peromyscus phylogeny, liver and testis transcriptomes from four species (P. attwateri, P. boylii, P. leucopus, and P. maniculatus) were generated and differential expression (DE) tests were conducted. Taxa were selected to represent members diverged from a common ancestor: P. attwateri + P. boylii (clade A), and P. leucopus + P. maniculatus (clade B). Comparison of clades (A vs. B) suggested that 252 transcripts had significant DE in the liver data set, whereas significant DE was identified for 657 transcripts in the testis data set. Further, 45 genes had DE isoforms in the 657 testis transcripts and most of these functioned in major reproductive roles such as acrosome assembly, spermatogenesis, and cell cycle processes (meiosis). DE transcripts in the liver mapped to more broad gene ontology terms (metabolic processes, catabolic processes, response to chemical, and regulatory processes), and DE transcripts in the testis mapped to gene ontology terms associated with reproductive processes, such as meiosis, sperm motility, acrosome assembly, and sperm–egg fusion. These results suggest that a suite of genes that conduct similar functions in the testes may be responsible for the adaptive radiation events and potential reoccurring speciation of Peromyscus in terms of reproduction through varying expression levels.

Whole Genome Sequencing and Assembly of the Asian Honey Bee Apis dorsata

Fri, 20 Dec 2019 00:00:00 GMT

Abstract
The Asian honey bee (Apis dorsata) is distinct from its more widely distributed cousin Apis mellifera by a few key characteristics. Most prominently, A. dorsata, nest in the open by forming a colony clustered around the honeycomb, whereas A. mellifera nest in concealed cavities. Additionally, the worker and reproductive castes are all of the same size in A. dorsata. In order to investigate these differences, we performed whole genome sequencing of A. dorsata using a hybrid Oxford Nanopore and Illumina approach. The 223 Mb genome has an N50 of 35 kb with the largest scaffold of 302 kb. We have found that there are many genes in the dorsata genome that are distinct from other hymenoptera and also large amounts of transposable elements, and we suggest some candidate genes for A. dorsata’s exceptional level of defensive aggression.

Comparative Plastome Analysis of Root- and Stem-Feeding Parasites of Santalales Untangle the Footprints of Feeding Mode and Lifestyle Transitions

Tue, 17 Dec 2019 00:00:00 GMT

Abstract
In plants, parasitism triggers the reductive evolution of plastid genomes (plastomes). To disentangle the molecular evolutionary associations between feeding on other plants below- or aboveground and general transitions from facultative to obligate parasitism, we analyzed 34 complete plastomes of autotrophic, root- and stem-feeding hemiparasitic, and holoparasitic Santalales. We observed inexplicable losses of housekeeping genes and tRNAs in hemiparasites and dramatic genomic reconfiguration in holoparasitic Balanophoraceae, whose plastomes have exceptionally low GC contents. Genomic changes are related primarily to the evolution of hemi- or holoparasitism, whereas the transition from a root- to a stem-feeding mode plays no major role. In contrast, the rate of molecular evolution accelerates in a stepwise manner from autotrophs to root- and then stem-feeding parasites. Already the ancestral transition to root-parasitism coincides with a relaxation of selection in plastomes. Another significant selectional shift in plastid genes occurs as stem-feeders evolve, suggesting that this derived form coincides with trophic specialization despite the retention of photosynthetic capacity. Parasitic Santalales fill a gap in our understanding of parasitism-associated plastome degeneration. We reveal that lifestyle-genome associations unfold interdependently over trophic specialization and feeding mode transitions, where holoparasitic Balanophoraceae provide a system for exploring the functional realms of plastomes.

Genome Sequences of 72 Bacterial Strains Isolated from Ectocarpus subulatus: A Resource for Algal Microbiology

Mon, 16 Dec 2019 00:00:00 GMT

Abstract
Brown algae are important primary producers and ecosystem engineers in the ocean, and Ectocarpus has been established as a laboratory model for this lineage. Like most multicellular organisms, Ectocarpus is associated with a community of microorganisms, a partnership frequently referred to as holobiont due to the tight interconnections between the components. Although genomic resources for the algal host are well established, its associated microbiome is poorly characterized from a genomic point of view, limiting the possibilities of using these types of data to study host–microbe interactions. To address this gap in knowledge, we present the annotated draft genome sequences of seventy-two cultivable Ectocarpus-associated bacteria. A screening of gene clusters related to the production of secondary metabolites revealed terpene, bacteriocin, NRPS, PKS-t3, siderophore, PKS-t1, and homoserine lactone clusters to be abundant among the sequenced genomes. These compounds may be used by the bacteria to communicate with the host and other microbes. Moreover, detoxification and provision of vitamin B pathways have been observed in most sequenced genomes, highlighting potential contributions of the bacterial metabolism toward host fitness and survival. The genomes sequenced in this study form a valuable resource for comparative genomic analyses and evolutionary surveys of alga-associated bacteria. They help establish Ectocarpus as a model for brown algal holobionts and will enable the research community to produce testable hypotheses about the molecular interactions within this complex system.

An Annotated Draft Genome of the Mountain Hare (Lepus timidus)

Fri, 13 Dec 2019 00:00:00 GMT

Abstract
Hares (genus Lepus) provide clear examples of repeated and often massive introgressive hybridization and striking local adaptations. Genomic studies on this group have so far relied on comparisons to the European rabbit (Oryctolagus cuniculus) reference genome. Here, we report the first de novo draft reference genome for a hare species, the mountain hare (Lepus timidus), and evaluate the efficacy of whole-genome re-sequencing analyses using the new reference versus using the rabbit reference genome. The genome was assembled using the ALLPATHS-LG protocol with a combination of overlapping pair and mate-pair Illumina sequencing (77x coverage). The assembly contained 32,294 scaffolds with a total length of 2.7 Gb and a scaffold N50 of 3.4 Mb. Re-scaffolding based on the rabbit reference reduced the total number of scaffolds to 4,205 with a scaffold N50 of 194 Mb. A correspondence was found between 22 of these hare scaffolds and the rabbit chromosomes, based on gene content and direct alignment. We annotated 24,578 protein coding genes by combining ab-initio predictions, homology search, and transcriptome data, of which 683 were solely derived from hare-specific transcriptome data. The hare reference genome is therefore a new resource to discover and investigate hare-specific variation. Similar estimates of heterozygosity and inferred demographic history profiles were obtained when mapping hare whole-genome re-sequencing data to the new hare draft genome or to alternative references based on the rabbit genome. Our results validate previous reference-based strategies and suggest that the chromosome-scale hare draft genome should enable chromosome-wide analyses and genome scans on hares.

Emergence and Evolution of ERM Proteins and Merlin in Metazoans

Thu, 12 Dec 2019 00:00:00 GMT

Abstract
Ezrin, radixin, moesin, and merlin are cytoskeletal proteins, whose functions are specific to metazoans. They participate in cell cortex rearrangement, including cell–cell contact formation, and play an important role in cancer progression. Here, we have performed a comprehensive phylogenetic analysis of the proteins spanning 87 species. The results describe a possible mechanism for the protein family origin in the root of Metazoa, paralogs diversification in vertebrates, and acquisition of novel functions, including tumor suppression. In addition, a merlin paralog, present in most vertebrates but lost in mammals, has been described here for the first time. We have also highlighted a set of amino acid variations within the conserved motifs as the candidates for determining physiological differences between ERM paralogs.

Microsporidia with Vertical Transmission Were Likely Shaped by Nonadaptive Processes

Wed, 11 Dec 2019 00:00:00 GMT

Abstract
Microsporidia have the leanest genomes among eukaryotes, and their physiological and genomic simplicity has been attributed to their intracellular, obligate parasitic life-style. However, not all microsporidia genomes are small or lean, with the largest dwarfing the smallest ones by at least an order of magnitude. To better understand the evolutionary mechanisms behind this genomic diversification, we explore here two clades of microsporidia with distinct life histories, Ordospora and Hamiltosporidium, parasitizing the same host species, Daphnia magna. Based on seven newly assembled genomes, we show that mixed-mode transmission (the combination of horizontal and vertical transmission), which occurs in Hamiltosporidium, is found to be associated with larger and AT-biased genomes, more genes, and longer intergenic regions, as compared with the exclusively horizontally transmitted Ordospora. Furthermore, the Hamiltosporidium genome assemblies contain a variety of repetitive elements and long segmental duplications. We show that there is an excess of nonsynonymous substitutions in the microsporidia with mixed-mode transmission, which cannot be solely attributed to the lack of recombination, suggesting that bursts of genome size in these microsporidia result primarily from genetic drift. Overall, these findings suggest that the switch from a horizontal-only to a mixed mode of transmission likely produces population bottlenecks in Hamiltosporidium species, therefore reducing the effectiveness of natural selection, and allowing their genomic features to be largely shaped by nonadaptive processes.

A High-Quality Reference Genome Assembly of the Saltwater Crocodile, Crocodylus porosus, Reveals Patterns of Selection in Crocodylidae

Tue, 10 Dec 2019 00:00:00 GMT

Abstract
Crocodilians are an economically, culturally, and biologically important group. To improve researchers’ ability to study genome structure, evolution, and gene regulation in the clade, we generated a high-quality de novo genome assembly of the saltwater crocodile, Crocodylus porosus, from Illumina short read data from genomic libraries and in vitro proximity-ligation libraries. The assembled genome is 2,123.5 Mb, with N50 scaffold size of 17.7 Mb and N90 scaffold size of 3.8 Mb. We then annotated this new assembly, increasing the number of annotated genes by 74%. In total, 96% of 23,242 annotated genes were associated with a functional protein domain. Furthermore, multiple noncoding functional regions and mappable genetic markers were identified. Upon analysis and overlapping the results of branch length estimation and site selection tests for detecting potential selection, we found 16 putative genes under positive selection in crocodilians, 10 in C. porosus and 6 in Alligator mississippiensis. The annotated C. porosus genome will serve as an important platform for osmoregulatory, physiological, and sex determination studies, as well as an important reference in investigating the phylogenetic relationships of crocodilians, birds, and other tetrapods.

Evolutionary History of the Toll-Like Receptor Gene Family across Vertebrates

Wed, 04 Dec 2019 00:00:00 GMT

Abstract
Adaptation to a wide range of pathogenic environments is a major aspect of the ecological adaptations of vertebrates during evolution. Toll-like receptors (TLRs) are ancient membrane-bound sensors in animals and are best known for their roles in detecting and defense against invading pathogenic microorganisms. To understand the evolutionary history of the vertebrate TLR gene family, we first traced the origin of single-cysteine cluster TLRs that share the same protein architecture with vertebrate TLRs in early-branching animals and then analyzed all members of the TLR family in over 200 species covering all major vertebrate clades. Our results indicate that although the emergence of single-cysteine cluster TLRs predates the separation of bilaterians and cnidarians, most vertebrate TLR members originated shortly after vertebrate emergence. Phylogenetic analyses divided 1,726 vertebrate TLRs into 8 subfamilies, and TLR3 may represent the most ancient subfamily that emerged before the branching of deuterostomes. Our analysis reveals that purifying selection predominated in the evolution of all vertebrate TLRs, with mean dN/dS (ω) values ranging from 0.082 for TLR21 in birds to 0.434 for TLR11 in mammals. However, we did observe patterns of positive selection acting on specific codons (527 of 60,294 codons across all vertebrate TLRs, 8.7‰), which are significantly concentrated in ligand-binding extracellular domains and suggest host–pathogen coevolutionary interactions. Additionally, we found stronger positive selection acting on nonviral compared with viral TLRs, indicating the more essential nonredundant function of viral TLRs in host immunity. Taken together, our findings provide comprehensive insight into the complex evolutionary processes of the vertebrate TLR gene family, involving gene duplication, pseudogenization, purification, and positive selection.

The Genome of the Blind Soil-Dwelling and Ancestrally Wingless Dipluran Campodea augens: A Key Reference Hexapod for Studying the Emergence of Insect Innovations

Tue, 03 Dec 2019 00:00:00 GMT

Abstract
The dipluran two-pronged bristletail Campodea augens is a blind ancestrally wingless hexapod with the remarkable capacity to regenerate lost body appendages such as its long antennae. As sister group to Insecta (sensu stricto), Diplura are key to understanding the early evolution of hexapods and the origin and evolution of insects. Here we report the 1.2-Gb draft genome of C. augens and results from comparative genomic analyses with other arthropods. In C. augens, we uncovered the largest chemosensory gene repertoire of ionotropic receptors in the animal kingdom, a massive expansion that might compensate for the loss of vision. We found a paucity of photoreceptor genes mirroring at the genomic level the secondary loss of an ancestral external photoreceptor organ. Expansions of detoxification and carbohydrate metabolism gene families might reflect adaptations for foraging behavior, and duplicated apoptotic genes might underlie its high regenerative potential. The C. augens genome represents one of the key references for studying the emergence of genomic innovations in insects, the most diverse animal group, and opens up novel opportunities to study the under-explored biology of diplurans.

The Mitogenome of Norway Spruce and a Reappraisal of Mitochondrial Recombination in Plants

Wed, 27 Nov 2019 00:00:00 GMT

Abstract
Plant mitogenomes can be difficult to assemble because they are structurally dynamic and prone to intergenomic DNA transfers, leading to the unusual situation where an organelle genome is far outnumbered by its nuclear counterparts. As a result, comparative mitogenome studies are in their infancy and some key aspects of genome evolution are still known mainly from pregenomic, qualitative methods. To help address these limitations, we combined machine learning and in silico enrichment of mitochondrial-like long reads to assemble the bacterial-sized mitogenome of Norway spruce (Pinaceae: Picea abies). We conducted comparative analyses of repeat abundance, intergenomic transfers, substitution and rearrangement rates, and estimated repeat-by-repeat homologous recombination rates. Prompted by our discovery of highly recombinogenic small repeats in P. abies, we assessed the genomic support for the prevailing hypothesis that intramolecular recombination is predominantly driven by repeat length, with larger repeats facilitating DNA exchange more readily. Overall, we found mixed support for this view: Recombination dynamics were heterogeneous across vascular plants and highly active small repeats (ca. 200 bp) were present in about one-third of studied mitogenomes. As in previous studies, we did not observe any robust relationships among commonly studied genome attributes, but we identify variation in recombination rates as a underinvestigated source of plant mitogenome diversity.

Genome Assembly of the Dogface Butterfly Zerene cesonia

Fri, 22 Nov 2019 00:00:00 GMT

Abstract
Comparisons of high-quality, reference butterfly, and moth genomes have been instrumental to advancing our understanding of how hybridization, and natural selection drive genomic change during the origin of new species and novel traits. Here, we present a genome assembly of the Southern Dogface butterfly, Zerene cesonia (Pieridae) whose brilliant wing colorations have been implicated in developmental plasticity, hybridization, sexual selection, and speciation. We assembled 266,407,278 bp of the Z. cesonia genome, which accounts for 98.3% of the estimated 271 Mb genome size. Using a hybrid approach involving Chicago libraries with Hi-Rise assembly and a diploid Meraculous assembly, the final haploid genome was assembled. In the final assembly, nearly all autosomes and the Z chromosome were assembled into single scaffolds. The largest 29 scaffolds accounted for 91.4% of the genome assembly, with the remaining ∼8% distributed among another 247 scaffolds and overall N50 of 9.2 Mb. Tissue-specific RNA-seq informed annotations identified 16,442 protein-coding genes, which included 93.2% of the arthropod Benchmarking Universal Single-Copy Orthologs (BUSCO). The Z. cesonia genome assembly had ∼9% identified as repetitive elements, with a transposable element landscape rich in helitrons. Similar to other Lepidoptera genomes, Z. cesonia showed a high conservation of chromosomal synteny. The Z. cesonia assembly provides a high-quality reference for studies of chromosomal arrangements in the Pierid family, as well as for population, phylo, and functional genomic studies of adaptation and speciation.

Evolutionary Basis of High-Frequency Hearing in the Cochleae of Echolocators Revealed by Comparative Genomics

Fri, 15 Nov 2019 00:00:00 GMT

Abstract
High-frequency hearing is important for the survival of both echolocating bats and whales, but our understanding of its genetic basis is scattered and segmented. In this study, we combined RNA-Seq and comparative genomic analyses to obtain insights into the comprehensive gene expression profile of the cochlea and the adaptive evolution of hearing-related genes. A total of 144 genes were found to have been under positive selection in various species of echolocating bats and toothed whales, 34 of which were identified to be related to hearing behavior or auditory processes. Subsequently, multiple physiological processes associated with those genes were found to have adaptively evolved in echolocating bats and toothed whales, including cochlear bony development, antioxidant activity, ion balance, and homeostatic processes, along with signal transduction. In addition, abundant convergent/parallel genes and sites were detected between different pairs of echolocator species; however, no specific hearing-related physiological pathways were enriched by them and almost all of the convergent/parallel signals were selectively neutral, as previously reported. Notably, two adaptive parallel evolved sites in TECPR2 were shown to have been under positive selection, indicating their functional importance for the evolution of echolocation and high-frequency hearing in laryngeal echolocating bats. This study deepens our understanding of the genetic bases underlying high-frequency hearing in the cochlea of echolocating bats and toothed whales.

Comparative Genomic Analysis of the Pheromone Receptor Class 1 Family (V1R) Reveals Extreme Complexity in Mouse Lemurs (Genus, Microcebus) and a Chromosomal Hotspot across Mammals

Thu, 14 Nov 2019 00:00:00 GMT

Abstract
Sensory gene families are of special interest for both what they can tell us about molecular evolution and what they imply as mediators of social communication. The vomeronasal type-1 receptors (V1Rs) have often been hypothesized as playing a fundamental role in driving or maintaining species boundaries given their likely function as mediators of intraspecific mate choice, particularly in nocturnal mammals. Here, we employ a comparative genomic approach for revealing patterns of V1R evolution within primates, with a special focus on the small-bodied nocturnal mouse and dwarf lemurs of Madagascar (genera Microcebus and Cheirogaleus, respectively). By doubling the existing genomic resources for strepsirrhine primates (i.e. the lemurs and lorises), we find that the highly speciose and morphologically cryptic mouse lemurs have experienced an elaborate proliferation of V1Rs that we argue is functionally related to their capacity for rapid lineage diversification. Contrary to a previous study that found equivalent degrees of V1R diversity in diurnal and nocturnal lemurs, our study finds a strong correlation between nocturnality and V1R elaboration, with nocturnal lemurs showing elaborate V1R repertoires and diurnal lemurs showing less diverse repertoires. Recognized subfamilies among V1Rs show unique signatures of diversifying positive selection, as might be expected if they have each evolved to respond to specific stimuli. Furthermore, a detailed syntenic comparison of mouse lemurs with mouse (genus Mus) and other mammalian outgroups shows that orthologous mammalian subfamilies, predicted to be of ancient origin, tend to cluster in a densely populated region across syntenic chromosomes that we refer to as a V1R “hotspot.”

The Mitochondrial Genome of Eleusine indica and Characterization of Gene Content within Poaceae

Wed, 23 Oct 2019 00:00:00 GMT

Abstract
Plant mitochondrial (mt) genome assembly provides baseline data on size, structure, and gene content, but resolving the sequence of these large and complex organelle genomes remains challenging due to fragmentation, frequent recombination, and transfers of DNA from neighboring plastids. The mt genome for Eleusine indica (Poaceae: goosegrass) is comprehensibly analyzed here, providing key reference data for an economically significant invasive species that is also the maternal parent of the allotetraploid crop Finger millet (Eleusine coracana). The assembled E. indica genome contains 33 protein coding genes, 6 rRNA subunits, 24 tRNA, 8 large repetitive regions 15 kb of transposable elements across a total of 520,691 bp. Evidence of RNA editing and loss of rpl2, rpl5, rps14, rps11, sdh4, and sdh3 genes is evaluated in the context of an updated survey of mt genomic gene content across the grasses through an analysis of publicly available data. Hypothesized patterns of Poaceae mt gene loss are examined in a phylogenetic context to clarify timing, showing that rpl2 was transferred to the nucleus from the mitochondrion prior to the origin of the PACMAD clade.

Impact of Mutation Rate and Selection at Linked Sites on DNA Variation across the Genomes of Humans and Other Homininae

Wed, 09 Oct 2019 00:00:00 GMT

Abstract
DNA diversity varies across the genome of many species. Variation in diversity across a genome might arise from regional variation in the mutation rate, variation in the intensity and mode of natural selection, and regional variation in the recombination rate. We show that both noncoding and nonsynonymous diversity are positively correlated to a measure of the mutation rate and the recombination rate and negatively correlated to the density of conserved sequences in 50 kb windows across the genomes of humans and nonhuman homininae. Interestingly, we find that although noncoding diversity is equally affected by these three genomic variables, nonsynonymous diversity is mostly dominated by the density of conserved sequences. The positive correlation between diversity and our measure of the mutation rate seems to be largely a direct consequence of regions with higher mutation rates having more diversity. However, the positive correlation with recombination rate and the negative correlation with the density of conserved sequences suggest that selection at linked sites also affect levels of diversity. This is supported by the observation that the ratio of the number of nonsynonymous to noncoding polymorphisms is negatively correlated to a measure of the effective population size across the genome. We show these patterns persist even when we restrict our analysis to GC-conservative mutations, demonstrating that the patterns are not driven by GC biased gene conversion. In conclusion, our comparative analyses describe how recombination rate, gene density, and mutation rate interact to produce the patterns of DNA diversity that we observe along the hominine genomes.

Transcriptome-Wide Patterns of the Genetic and Expression Variations in Two Sympatric Schizothoracine Fishes in a Tibetan Plateau Glacier Lake

Wed, 09 Jan 2019 00:00:00 GMT

Abstract
Sympatric speciation remains a central focus of evolutionary biology. Although some evidence shows speciation occurring in this way, little is known about the gene expression evolution and the characteristics of population genetics as species diverge. Two closely related Gymnocypris fish (Gymnocypris chui and Gymnocypris scleracanthus), which come from a small glacier lake in the Tibetan Plateau, Lake Langcuo, exist a possible incipient sympatric adaptive ecological speciation. We generated large amounts of RNA-Seq data from multiple individuals and tissues from each of the two species and compared gene expression patterns and genetic polymorphisms between them. Ordination analysis separated samples by organ rather than by species. The degree of expression difference between organs within and between species was different. Phylogenetic analyses indicated that the two closely related taxa formed a monophyletic complex. Population structure analysis displayed two distinctly divergent clusters of G. chui and G. scleracanthus populations. By contrast, G. scleracanthus population genetic diversity is higher than that of G. chui. Considerable sites of the two populations were differentiated with a coefficient of FST = 0.25–0.50, implying that a small proportion of loci nevertheless exhibited deep divergence in two comparisons. Concomitantly, putatively selected genes during speciation revealed functional categories are enriched in bone morphogenesis, cell growth, neurogenetics, enzyme activity, and binding activity in G. chui population. In contrast, nutrition and localization were highlighted in G. scleracanthus. Collectively, morphological traits and dietary preference combine with genetic variation and expression variation, probably contributed to the incipient speciation of two sympatric populations.