Repeated DNA of the human Y chromosome

Development ◽  
1987 ◽  
Vol 101 (Supplement) ◽  
pp. 77-92
Author(s):  
Kirby D. Smith ◽  
Keith E. Young ◽  
C. Conover Talbot ◽  
Barbara J. Schmeckpeper

A significant fraction of the human Y chromosome is composed of DNA sequences which have homologues on the X chromosome or autosomes in humans and non-human primates. However, most human Ychromosome sequences so far examined do not have homologues on the Y chromosomes of other primates. This observation suggests that a significant proportion of the human Y chromosome is composed of sequences that have acquired their Y-chromosome association since humans diverged from other primates. More than 50 % of the human Y chromosome is composed of a variety of repeated DNAs which, with one known exception, can be distinguished from homologues elsewhere in the genome. These include the alphoid repeats, the major human SINE (Alu repeats) and several additional families of repeats which account for the majority of Y-chromosome repeated DNA. The alphoid sequences tandemly clustered near the centromere on the Y chromosome can be distinguished from those on other chromosomes by both sequence and repeat organization, while the majority of Y-chromosome Alu repeats have little homology with genomic consensus Alu sequences. In contrast, the Y-chromosome LINE repeats cannot be distinguished from LINEs found on other chromosomes. It has been proposed that both SINE and LINE repeats have been dispersed throughout the genome by mechanisms that involve RNA intermediates. The difference in the relationship of the Y-chromosome Alu and LINE repeats to their respective family members elsewhere in the genome makes it possible that their dispersal to the Y chromosome has occurred by different mechanisms or at different rates. In addition to the SINE and LINE repeats, the human Y chromosome contains a group of repeated DNA elements originally identified as 3·4 and 2·1 kb fragments in HaeIII digests of male genomic DNA. Although the 3·4 and 2·1 kb Y repeats do not crossreact, both exist as tandem clusters of alternating Yspecific and non-Y-specific sequences. The 3·4 kb Y repeats contain at least three distinct sequences with autosomal homologies interspersed in various ways with a collection of several different Yspecific repeat sequences. Individual recombinant clones derived from isolated 3·4 kb HaeIII Y fragments have been identified which do not cross-react. Thus, the 3·4 kb HaeIII Y fragments are a heterogeneous mixture of sequences which have in common the regular occurrence of HaeIII restriction sites at 3·4 kb intervals and an organization as tandem clusters at various sites along the Y-long arm. The 2·1 kb HaeIII Y fragment cross-reacts with a 1i9 kb HaeIII autosomal fragment. Both the Ychromosomal and autosomal fragments are part of tandem clusters which have a unit length of 2·4 kb. All of the 2·4 kb Y repeats are similar and contain a 1·6 kb Y-specific repeat and an 800 bp sequence which has homology with an 800 bp sequence in the autosomal 2·4 kb repeats. While this 800 bp sequence is common to both Y and autosomal 2·4 kb repeats and is associated with a single Y-specific repeat, it is associated with at least four non-cross-reacting autosome-specific sequences. Like the Y repeat, the autosomal repeats exist as tandem clusters of 2·4 kb units and are composed of an 800 bp common sequence alternating with a 1·6 kb autosome-specific sequence. Thus, in humans, the common sequence is associated with several different sequences yet always occurs as part of a tandem cluster of 2·4 kb repeats. The common and autosome-specific sequences of the 2·4 kb repeats are also present in gorillas as part of organized repeat units. However, in gorillas the two are not associated with each other. The Y-chromosome repeats described here are a heterogeneous mixture of sequences organized into specific sets of alternating Y-specific and non-Y-specific sequences. They do not have an identified function and the mechanisms by which they are generated are unknown. Nevertheless, their marked chromosomal speciticity and the regularity of the basic repeat unit in each type of repeat seem inconsistent with stochastic mechanisms of sequence diffusion between chromosomes.

Genomics ◽  
1989 ◽  
Vol 5 (1) ◽  
pp. 153-156 ◽  
Author(s):  
Ulrich Müller ◽  
Marc Lalande ◽  
Timothy A. Donlon ◽  
Michael W. Heartlein

2020 ◽  
Vol 160 (5) ◽  
pp. 264-271
Author(s):  
Juana Gutierrez ◽  
Gael Aleix-Mata ◽  
Juan A. Marchal ◽  
María Arroyo ◽  
Riccardo Castiglia ◽  
...  

The Talpidae family has a highly stable karyotype. Most of the chromosome studies in this mammal group, however, employed classical cytogenetic techniques. Molecular cytogenetic analyses are still scarce and, for example, no repeated DNA sequences have been described to date. In this work, we used sequence analysis, chromosomal mapping of a LINE1 retroelement sequence, as well as chromosome painting with a whole Y chromosome probe of T. occidentalis to compare the karyotypes of 3 species of the genus Talpa (T. occidentalis, T. romana, and T. aquitania). Our results demonstrate that in Talpa genomes LINE1 sequences are widely distributed on all chromosomes but are enriched in pericentromeric C-band-positive regions. In addition, these LINE1 accumulate on the Y chromosomes of the 3 Talpa species regardless of their euchromatic or heterochromatic condition. Chromosome painting shows that the Y chromosomes in these 3 species are highly conserved. Interestingly, they share sequences with heterochromatic blocks on chromosome pairs 14 and 16 and, to a lesser degree, with the pericentromeric regions of other autosomes. Together, our analyses demonstrate that the repetitive DNA content of chromosomes from Talpa species is highly conserved.


This paper reviews past and present trends in mapping the human Y chromosome. So far, mapping has essentially used a combination of cytogenetic and molecular analyses of Y-chromosomal anomalies and sex reversal syndromes. This deletion mapping culminated recently in the isolation of the putative sex-determining locus TDF . With the availability of new separation and cloning techniques suited for large size fragments (over 100 kilobases), the next step will consist rather in the establishment of a physical map of fragments of known physical sizes. This may allow the definition of several variants of the human Y chromosome differing by the order or location of DNA sequences along the molecule.


1989 ◽  
Vol 9 (3) ◽  
pp. 1173-1182
Author(s):  
K Lowenhaupt ◽  
A Rich ◽  
M L Pardue

Long stretches of (dC-dA)n.(dT-dG)n, abbreviated CA/TG, have a distinctive distribution on Drosophila chromosomes (M.L. Pardue, K. Lowenhaupt, A. Rich, and A. Nordheim, EMBO J. 6:1781-1789, 1987). The distribution of CA/TG suggests a correlation with the overall transcriptional activity of chromosomal regions and with the ability to undergo meiotic recombination. These correlations are conserved among Drosophila species and may indicate one or more chromosomal functions. To test the generality of these findings, we analyzed the distribution of the rest of the six possible mono- and dinucleotide repeats (A/T, C/G, AT/AT, CA/TG, CT/AG, and CG/CG). All but CG/CG were present at significant levels in the genomes of the six Drosophila species studied; however, A/T levels were an order of magnitude lower than those of the other sequences. Data base analyses suggested that the same sequences are present in other eucaryotes. Like CA/TG, both CT/AG and C/G showed increased levels on dosage-compensating chromosomes; however, the individual sites clearly differed for each sequence. In contrast, A/T and AT/AT, although present in Drosophila DNA, could not be detected in situ in polytene chromosomes. We also used in situ hybridization to analyze the neo-Y chromosome of Drosophila miranda, an ancestral autosome that has become attached to the Y chromosome and is now partially heterochromatic. The neo-Y has acquired repeated DNA sequences; we found that the added sequences are as devoid of mono- and dinucleotide repeats as other heterochromatin. The distribution and function of these sequences are likely to result from both their repetitious nature and base contents.


2021 ◽  
Vol 4 (1) ◽  
Author(s):  
Qiuhui Li ◽  
Shilin Tian ◽  
Bin Yan ◽  
Chi Man Liu ◽  
Tak-Wah Lam ◽  
...  

AbstractPan-genome sequence analysis of human population ancestry is critical for expanding and better defining human genome sequence diversity. However, the amount of genetic variation still missing from current human reference sequences is still unknown. Here, we used 486 deep-sequenced Han Chinese genomes to identify 276 Mbp of DNA sequences that, to our knowledge, are absent in the current human reference. We classified these sequences into individual-specific and common sequences, and propose that the common sequence size is uncapped with a growing population. The 46.646 Mbp common sequences obtained from the 486 individuals improved the accuracy of variant calling and mapping rate when added to the reference genome. We also analyzed the genomic positions of these common sequences and found that they came from genomic regions characterized by high mutation rate and low pathogenicity. Our study authenticates the Chinese pan-genome as representative of DNA sequences specific to the Han Chinese population missing from the GRCh38 reference genome and establishes the newly defined common sequences as candidates to supplement the current human reference.


Sign in / Sign up

Export Citation Format

Share Document