Docling/tests/data/jats/elife-56337.txt
Cesar Berrospi Ramis 428b656793
feat(xml-jats): parse XML JATS documents (#967)
* chore(xml-jats): separate authors and affiliations

In XML PubMed (JATS) backend, convert authors and affiliations as they
are typically rendered on PDFs.

Signed-off-by: Cesar Berrospi Ramis <75900930+ceberam@users.noreply.github.com>

* fix(xml-jats): replace new line character by a space

Instead of removing new line character from text, replace it by a space character.

Signed-off-by: Cesar Berrospi Ramis <75900930+ceberam@users.noreply.github.com>

* feat(xml-jats): improve existing parser and extend features

Partially support lists, respect reading order, parse more sections, support equations, better text formatting.

Signed-off-by: Cesar Berrospi Ramis <75900930+ceberam@users.noreply.github.com>

* chore(xml-jats): rename PubMed objects to JATS

Signed-off-by: Cesar Berrospi Ramis <75900930+ceberam@users.noreply.github.com>

---------

Signed-off-by: Cesar Berrospi Ramis <75900930+ceberam@users.noreply.github.com>
2025-02-17 10:43:31 +01:00

8 lines
179 KiB
Plaintext
Vendored

<!DOCTYPE article
PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD with MathML3 v1.2 20190208//EN" "JATS-archivearticle1-mathml3.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:mml="http://www.w3.org/1998/Math/MathML" article-type="research-article"><?properties open_access?><front><journal-meta><journal-id journal-id-type="nlm-ta">eLife</journal-id><journal-id journal-id-type="iso-abbrev">Elife</journal-id><journal-id journal-id-type="publisher-id">eLife</journal-id><journal-title-group><journal-title>eLife</journal-title></journal-title-group><issn pub-type="epub">2050-084X</issn><publisher><publisher-name>eLife Sciences Publications, Ltd</publisher-name></publisher></journal-meta><article-meta><article-id pub-id-type="pmid">32479262</article-id><article-id pub-id-type="pmc">7289599</article-id><article-id pub-id-type="publisher-id">56337</article-id><article-id pub-id-type="doi">10.7554/eLife.56337</article-id><article-categories><subj-group subj-group-type="display-channel"><subject>Research Article</subject></subj-group><subj-group subj-group-type="heading"><subject>Chromosomes and Gene Expression</subject></subj-group><subj-group subj-group-type="heading"><subject>Genetics and Genomics</subject></subj-group></article-categories><title-group><article-title>KRAB-zinc finger protein gene expansion in response to active retrotransposons in the murine lineage</article-title></title-group><contrib-group><contrib id="author-177034" contrib-type="author"><name><surname>Wolf</surname><given-names>Gernot</given-names></name><contrib-id authenticated="true" contrib-id-type="orcid">https://orcid.org/0000-0002-3943-8662</contrib-id><xref ref-type="aff" rid="aff1">1</xref><xref ref-type="fn" rid="con1"/><xref ref-type="fn" rid="conf1"/><xref ref-type="author-notes" rid="pa1">&#x02020;</xref></contrib><contrib id="author-177086" contrib-type="author"><name><surname>de Iaco</surname><given-names>Alberto</given-names></name><xref ref-type="aff" rid="aff2">2</xref><xref ref-type="fn" rid="con2"/><xref ref-type="fn" rid="conf1"/></contrib><contrib id="author-177059" contrib-type="author"><name><surname>Sun</surname><given-names>Ming-An</given-names></name><xref ref-type="aff" rid="aff1">1</xref><xref ref-type="fn" rid="con3"/><xref ref-type="fn" rid="conf1"/></contrib><contrib id="author-163910" contrib-type="author"><name><surname>Bruno</surname><given-names>Melania</given-names></name><contrib-id authenticated="true" contrib-id-type="orcid">http://orcid.org/0000-0002-8401-7744</contrib-id><xref ref-type="aff" rid="aff1">1</xref><xref ref-type="fn" rid="con4"/><xref ref-type="fn" rid="conf1"/></contrib><contrib id="author-177056" contrib-type="author"><name><surname>Tinkham</surname><given-names>Matthew</given-names></name><xref ref-type="aff" rid="aff1">1</xref><xref ref-type="fn" rid="con5"/><xref ref-type="fn" rid="conf1"/></contrib><contrib id="author-177057" contrib-type="author"><name><surname>Hoang</surname><given-names>Don</given-names></name><xref ref-type="aff" rid="aff1">1</xref><xref ref-type="fn" rid="con6"/><xref ref-type="fn" rid="conf1"/></contrib><contrib id="author-177058" contrib-type="author"><name><surname>Mitra</surname><given-names>Apratim</given-names></name><xref ref-type="aff" rid="aff1">1</xref><xref ref-type="fn" rid="con7"/><xref ref-type="fn" rid="conf1"/></contrib><contrib id="author-163912" contrib-type="author"><name><surname>Ralls</surname><given-names>Sherry</given-names></name><xref ref-type="aff" rid="aff1">1</xref><xref ref-type="fn" rid="con8"/><xref ref-type="fn" rid="conf1"/></contrib><contrib id="author-3909" contrib-type="author"><name><surname>Trono</surname><given-names>Didier</given-names></name><contrib-id authenticated="true" contrib-id-type="orcid">http://orcid.org/0000-0002-3383-0401</contrib-id><xref ref-type="aff" rid="aff2">2</xref><xref ref-type="other" rid="fund2"/><xref ref-type="other" rid="fund3"/><xref ref-type="other" rid="fund4"/><xref ref-type="other" rid="fund5"/><xref ref-type="fn" rid="con9"/><xref ref-type="fn" rid="conf1"/></contrib><contrib id="author-45803" contrib-type="author" corresp="yes"><name><surname>Macfarlan</surname><given-names>Todd S</given-names></name><contrib-id authenticated="true" contrib-id-type="orcid">https://orcid.org/0000-0003-2495-9809</contrib-id><email>todd.macfarlan@nih.gov</email><xref ref-type="aff" rid="aff1">1</xref><xref ref-type="other" rid="fund1"/><xref ref-type="fn" rid="con10"/><xref ref-type="fn" rid="conf1"/></contrib><aff id="aff1"><label>1</label><institution>The Eunice Kennedy Shriver National Institute of Child Health and Human Development, The National Institutes of Health</institution><addr-line>Bethesda</addr-line><country>United States</country></aff><aff id="aff2"><label>2</label><institution>School of Life Sciences, &#x000c9;cole Polytechnique F&#x000e9;d&#x000e9;rale de Lausanne (EPFL)</institution><addr-line>Lausanne</addr-line><country>Switzerland</country></aff></contrib-group><contrib-group><contrib contrib-type="editor"><name><surname>Bourc'his</surname><given-names>Deborah</given-names></name><role>Reviewing Editor</role><aff><institution>Institut Curie</institution><country>France</country></aff></contrib><contrib contrib-type="editor"><name><surname>Perry</surname><given-names>George H</given-names></name><role>Senior Editor</role><aff><institution>Pennsylvania State University</institution><country>United States</country></aff></contrib></contrib-group><author-notes><fn fn-type="present-address" id="pa1"><label>&#x02020;</label><p>CeMM Research Center for Molecular Medicine of the Austrian Academy of Sciences, Vienna, Austria.</p></fn></author-notes><pub-date date-type="pub" publication-format="electronic"><day>01</day><month>6</month><year>2020</year></pub-date><pub-date pub-type="collection"><year>2020</year></pub-date><volume>9</volume><elocation-id>e56337</elocation-id><history><date date-type="received" iso-8601-date="2020-02-24"><day>24</day><month>2</month><year>2020</year></date><date date-type="accepted" iso-8601-date="2020-05-31"><day>31</day><month>5</month><year>2020</year></date></history><permissions><ali:free_to_read xmlns:ali="http://www.niso.org/schemas/ali/1.0/"/><license xlink:href="http://creativecommons.org/publicdomain/zero/1.0/"><ali:license_ref xmlns:ali="http://www.niso.org/schemas/ali/1.0/">http://creativecommons.org/publicdomain/zero/1.0/</ali:license_ref><license-p>This is an open-access article, free of all copyright, and may be freely reproduced, distributed, transmitted, modified, built upon, or otherwise used by anyone for any lawful purpose. The work is made available under the <ext-link ext-link-type="uri" xlink:href="http://creativecommons.org/publicdomain/zero/1.0/">Creative Commons CC0 public domain dedication</ext-link>.</license-p></license></permissions><self-uri content-type="pdf" xlink:href="elife-56337.pdf"/><abstract><p>The Kr&#x000fc;ppel-associated box zinc finger protein (KRAB-ZFP) family diversified in mammals. The majority of human KRAB-ZFPs bind transposable elements (TEs), however, since most TEs are inactive in humans it is unclear whether KRAB-ZFPs emerged to suppress TEs. We demonstrate that many recently emerged murine KRAB-ZFPs also bind to TEs, including the active ETn, IAP, and L1 families. Using a CRISPR/Cas9-based engineering approach, we genetically deleted five large clusters of KRAB-ZFPs and demonstrate that target TEs are de-repressed, unleashing TE-encoded enhancers. Homozygous knockout mice lacking one of two KRAB-ZFP gene clusters on chromosome 2 and chromosome 4 were nonetheless viable. In pedigrees of chromosome 4 cluster KRAB-ZFP mutants, we identified numerous novel ETn insertions with a modest increase in mutants. Our data strongly support the current model that recent waves of retrotransposon activity drove the expansion of KRAB-ZFP genes in mice and that many KRAB-ZFPs play a redundant role restricting TE activity.</p></abstract><kwd-group kwd-group-type="author-keywords"><kwd>KRAB-ZFP</kwd><kwd>transposable elements</kwd><kwd>retrotransposons</kwd><kwd>evolution</kwd><kwd>chromatin</kwd><kwd>development</kwd></kwd-group><kwd-group kwd-group-type="research-organism"><title>Research organism</title><kwd>Mouse</kwd></kwd-group><funding-group><award-group id="fund1"><funding-source><institution-wrap><institution-id institution-id-type="FundRef">http://dx.doi.org/10.13039/100009633</institution-id><institution>Eunice Kennedy Shriver National Institute of Child Health and Human Development</institution></institution-wrap></funding-source><award-id>1ZIAHD008933</award-id><principal-award-recipient><name><surname>Macfarlan</surname><given-names>Todd S</given-names></name></principal-award-recipient></award-group><award-group id="fund2"><funding-source><institution-wrap><institution-id institution-id-type="FundRef">http://dx.doi.org/10.13039/501100001711</institution-id><institution>Swiss National Science Foundation</institution></institution-wrap></funding-source><award-id>310030_152879</award-id><principal-award-recipient><name><surname>Trono</surname><given-names>Didier</given-names></name></principal-award-recipient></award-group><award-group id="fund3"><funding-source><institution-wrap><institution-id institution-id-type="FundRef">http://dx.doi.org/10.13039/501100001711</institution-id><institution>Swiss National Science Foundation</institution></institution-wrap></funding-source><award-id>310030B_173337</award-id><principal-award-recipient><name><surname>Trono</surname><given-names>Didier</given-names></name></principal-award-recipient></award-group><award-group id="fund4"><funding-source><institution-wrap><institution-id institution-id-type="FundRef">http://dx.doi.org/10.13039/501100000781</institution-id><institution>European Research Council</institution></institution-wrap></funding-source><award-id>No. 268721</award-id><principal-award-recipient><name><surname>Trono</surname><given-names>Didier</given-names></name></principal-award-recipient></award-group><award-group id="fund5"><funding-source><institution-wrap><institution-id institution-id-type="FundRef">http://dx.doi.org/10.13039/501100000781</institution-id><institution>European Research Council</institution></institution-wrap></funding-source><award-id>No 694658</award-id><principal-award-recipient><name><surname>Trono</surname><given-names>Didier</given-names></name></principal-award-recipient></award-group><funding-statement>The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.</funding-statement></funding-group><custom-meta-group><custom-meta specific-use="meta-only"><meta-name>Author impact statement</meta-name><meta-value>Unbiased ChIP-seq screens and genetic knockouts of large Kruppel associated box zinc finger protein (KRAB-ZFP) clusters reveal that evolutionarily young KRAB-ZFPs play a redundant role in retrotransposon restriction in mice.</meta-value></custom-meta></custom-meta-group></article-meta></front><body><sec sec-type="intro" id="s1"><title>Introduction</title><p>Nearly half of the human and mouse genomes consist of transposable elements (TEs). TEs shape the evolution of species, serving as a source for genetic innovation (<xref rid="bib6" ref-type="bibr">Chuong et al., 2016</xref>; <xref rid="bib13" ref-type="bibr">Frank and Feschotte, 2017</xref>). However, TEs also potentially harm their hosts by insertional mutagenesis, gene deregulation and activation of innate immunity (<xref rid="bib33" ref-type="bibr">Maksakova et al., 2006</xref>; <xref rid="bib19" ref-type="bibr">Kano et al., 2007</xref>; <xref rid="bib4" ref-type="bibr">Brodziak et al., 2012</xref>; <xref rid="bib16" ref-type="bibr">Hancks and Kazazian, 2016</xref>). To protect themselves from TE activity, host organisms have developed a wide range of defense mechanisms targeting virtually all steps of the TE life cycle (<xref rid="bib10" ref-type="bibr">Dewannieux and Heidmann, 2013</xref>). In tetrapods, KRAB zinc finger protein (KRAB-ZFP) genes have amplified and diversified, likely in response to TE colonization (<xref rid="bib47" ref-type="bibr">Thomas and Schneider, 2011</xref>; <xref rid="bib35" ref-type="bibr">Najafabadi et al., 2015</xref>; <xref rid="bib53" ref-type="bibr">Wolf et al., 2015a</xref>; <xref rid="bib54" ref-type="bibr">Wolf et al., 2015b</xref>; <xref rid="bib17" ref-type="bibr">Imbeault et al., 2017</xref>). Conventional ZFPs bind DNA using tandem arrays of C2H2 zinc finger domains, each capable of specifically interacting with three nucleotides, whereas some zinc fingers can bind two or four nucleotides and include DNA backbone interactions depending on target DNA structure (<xref rid="bib38" ref-type="bibr">Patel et al., 2018</xref>). This allows KRAB-ZFPs to flexibly bind to large stretches of DNA with high affinity. The KRAB domain binds the corepressor KAP1, which in turn recruits histone modifying enzymes including the NuRD histone deacetylase complex and the H3K9-specific methylase SETDB1 (<xref rid="bib44" ref-type="bibr">Schultz et al., 2002</xref>; <xref rid="bib46" ref-type="bibr">Sripathy et al., 2006</xref>), which induces persistent and heritable gene silencing (<xref rid="bib15" ref-type="bibr">Groner et al., 2010</xref>). Deletion of KAP1 (<xref rid="bib41" ref-type="bibr">Rowe et al., 2010</xref>) or SETDB1 (<xref rid="bib34" ref-type="bibr">Matsui et al., 2010</xref>) in mouse embryonic stem (ES) cells induces TE reactivation and cell death, but only minor phenotypes in differentiated cells, suggesting KRAB-ZFPs are most important during early embryogenesis where they mark TEs for stable epigenetic silencing that persists through development. However, SETDB1-containing complexes are also required to repress TEs in primordial germ cells (<xref rid="bib29" ref-type="bibr">Liu et al., 2014</xref>) and adult tissues (<xref rid="bib11" ref-type="bibr">Ecco et al., 2016</xref>), indicating KRAB-ZFPs are active beyond early development.</p><p>TEs, especially long terminal repeat (LTR) retrotransposons, also known as endogenous retroviruses (ERVs), can affect expression of neighboring genes through their promoter and enhancer functions (<xref rid="bib32" ref-type="bibr">Macfarlan et al., 2012</xref>; <xref rid="bib51" ref-type="bibr">Wang et al., 2014</xref>; <xref rid="bib48" ref-type="bibr">Thompson et al., 2016</xref>). KAP1 deletion in mouse ES cells causes rapid gene deregulation (<xref rid="bib42" ref-type="bibr">Rowe et al., 2013</xref>), indicating that KRAB-ZFPs may regulate gene expression by recruiting KAP1 to TEs. Indeed, <italic>Zfp809</italic> knock-out (KO) in mice resulted in transcriptional activation of a handful of genes in various tissues adjacent to ZFP809-targeted VL30-Pro elements (<xref rid="bib54" ref-type="bibr">Wolf et al., 2015b</xref>). It has therefore been speculated that KRAB-ZFPs bind to TE sequences to domesticate them for gene regulatory innovation (<xref rid="bib12" ref-type="bibr">Ecco et al., 2017</xref>). This idea is supported by the observation that many human KRAB-ZFPs target TE groups that have lost their coding potential millions of years ago and that KRAB-ZFP target sequences within TEs are in some cases under purifying selection (<xref rid="bib17" ref-type="bibr">Imbeault et al., 2017</xref>). However, there are also clear signs of an evolutionary arms-race between human TEs and KRAB-ZFPs (<xref rid="bib18" ref-type="bibr">Jacobs et al., 2014</xref>), indicating that some KRAB-ZFPs may limit TE mobility for stretches of evolutionary time, prior to their ultimate loss from the genome or adaptation for other regulatory functions. Here we use the laboratory mouse, which has undergone a recent expansion of the KRAB-ZFP family, to determine the in vivo requirement of the majority of evolutionarily young KRAB-ZFP genes.</p></sec><sec sec-type="results" id="s2"><title>Results</title><sec id="s2-1"><title>Mouse KRAB-ZFPs target retrotransposons</title><p>We analyzed the RNA expression profiles of mouse KRAB-ZFPs across a wide range of tissues to identify candidates active in early embryos/ES cells. While the majority of KRAB-ZFPs are expressed at low levels and uniformly across tissues, a group of KRAB-ZFPs are highly and almost exclusively expressed in ES cells (<xref ref-type="fig" rid="fig1s1">Figure 1&#x02014;figure supplement 1A</xref>). About two thirds of these KRAB-ZFPs are physically linked in two clusters on chromosome 2 (Chr2-cl) and 4 (Chr4-cl) (<xref ref-type="fig" rid="fig1s1">Figure 1&#x02014;figure supplement 1B</xref>). These two clusters encode 40 and 21 KRAB-ZFP annotated genes, respectively, which, with one exception on Chr4-cl, do not have orthologues in rat or any other sequenced mammals (<xref ref-type="supplementary-material" rid="supp1">Supplementary file 1</xref>). The KRAB-ZFPs within these two genomic clusters also group together phylogenetically (<xref ref-type="fig" rid="fig1s1">Figure 1&#x02014;figure supplement 1C</xref>), indicating these gene clusters arose by a series of recent segmental gene duplications (<xref rid="bib21" ref-type="bibr">Kauzlaric et al., 2017</xref>).</p><p>To determine the binding sites of the KRAB-ZFPs within these and other gene clusters, we expressed epitope-tagged KRAB-ZFPs using stably integrating vectors in mouse embryonic carcinoma (EC) or ES cells (<xref rid="table1" ref-type="table">Table 1</xref>, <xref ref-type="supplementary-material" rid="supp1">Supplementary file 1</xref>) and performed chromatin immunoprecipitation followed by deep sequencing (ChIP-seq). We then determined whether the identified binding sites are significantly enriched over annotated TEs and used the non-repetitive peak fraction to identify binding motifs. We discarded 7 of 68 ChIP-seq datasets because we could not obtain a binding motif or a target TE and manual inspection confirmed low signal to noise ratio. Of the remaining 61 KRAB-ZFPs, 51 significantly overlapped at least one TE subfamily (<italic>adjusted p-value</italic>&#x0003c;1e-5). Altogether, 81 LTR retrotransposon, 18 LINE, 10 SINE and one DNA transposon subfamilies were targeted by at least one of the 51 KRAB-ZFPs (<xref ref-type="fig" rid="fig1">Figure 1A</xref> and <xref ref-type="supplementary-material" rid="supp1">Supplementary file 1</xref>). Chr2-cl KRAB-ZFPs preferably bound IAPEz retrotransposons and L1-type LINEs, while Chr4-cl KRAB-ZFPs targeted various retrotransposons, including the closely related MMETn (hereafter referred to as ETn) and ETnERV (also known as MusD) elements (<xref ref-type="fig" rid="fig1">Figure 1A</xref>). ETn elements are non-autonomous LTR retrotransposons that require trans-complementation by the fully coding ETnERV elements that contain Gag, Pro and Pol genes (<xref rid="bib39" ref-type="bibr">Ribet et al., 2004</xref>). These elements have accumulated to&#x000a0;~240 and~100 copies in the reference C57BL/6 genome, respectively, with&#x000a0;~550 solitary LTRs (<xref rid="bib2" ref-type="bibr">Baust et al., 2003</xref>). Both ETn and ETnERVs are still active, generating polymorphisms and mutations in several mouse strains (<xref rid="bib14" ref-type="bibr">Gagnier et al., 2019</xref>). The validity of our ChIP-seq screen was confirmed by the identification of binding motifs - which often resembled the computationally predicted motifs (<xref ref-type="fig" rid="fig1s2">Figure 1&#x02014;figure supplement 2A</xref>) - for the majority of screened KRAB-ZFPs (<xref ref-type="supplementary-material" rid="supp1">Supplementary file 1</xref>). Moreover, predicted and experimentally determined motifs were found in targeted TEs in most cases (<xref ref-type="supplementary-material" rid="supp1">Supplementary file 1</xref>), and reporter repression assays confirmed KRAB-ZFP induced silencing for all the tested sequences (<xref ref-type="fig" rid="fig1s2">Figure 1&#x02014;figure supplement 2B</xref>). Finally, we observed KAP1 and H3K9me3 enrichment at most of the targeted TEs in wild type ES cells, indicating that most of these KRAB-ZFPs are functionally active in the early embryo (<xref ref-type="fig" rid="fig1">Figure 1A</xref>).</p><fig id="fig1" position="float" orientation="portrait"><label>Figure 1.</label><caption><title>Genome-wide binding patterns of mouse KRAB-ZFPs.</title><p>(<bold>A</bold>) Probability heatmap of KRAB-ZFP binding to TEs. Blue color intensity (main field) corresponds to -log10 (<italic>adjusted p-value</italic>) enrichment of ChIP-seq peak overlap with TE groups (Fisher&#x02019;s exact test). The green/red color intensity (top panel) represents mean KAP1 (GEO accession: GSM1406445) and H3K9me3 (GEO accession: GSM1327148) enrichment (respectively) at peaks overlapping significantly targeted TEs (<italic>adjusted p-value</italic>&#x0003c;1e-5) in WT ES cells. (<bold>B</bold>) Summarized ChIP-seq signal for indicated KRAB-ZFPs and previously published KAP1 and H3K9me3 in WT ES cells across 127 intact ETn elements. (<bold>C</bold>) Heatmaps of KRAB-ZFP ChIP-seq signal at ChIP-seq peaks. For better comparison, peaks for all three KRAB-ZFPs were called with the same parameters (p&#x0003c;1e-10, peak enrichment&#x000a0;&#x0003e;20). The top panel shows a schematic of the arrangement of the contact amino acid composition of each zinc finger. Zinc fingers are grouped and colored according to similarity, with amino acid differences relative to the five consensus fingers highlighted in white.</p><p><supplementary-material content-type="local-data" id="fig1sdata1"><label>Figure 1&#x02014;source data 1.</label><caption><title>KRAB-ZFP expression in 40 mouse tissues and cell lines (ENCODE).</title><p>Mean values of replicates are shown as log2 transcripts per million.</p></caption><media mime-subtype="xlsx" mimetype="application" xlink:href="elife-56337-fig1-data1.xlsx" orientation="portrait" id="d36e553" position="anchor"/></supplementary-material></p><p><supplementary-material content-type="local-data" id="fig1sdata2"><label>Figure 1&#x02014;source data 2.</label><caption><title>Probability heatmap of KRAB-ZFP binding to TEs.</title><p>Values corresponds to -log10 (<italic>adjusted p-value</italic>) enrichment of ChIP-seq peak overlap with TE groups (Fisher&#x02019;s exact test).</p></caption><media mime-subtype="xlsx" mimetype="application" xlink:href="elife-56337-fig1-data2.xlsx" orientation="portrait" id="d36e566" position="anchor"/></supplementary-material></p></caption><graphic xlink:href="elife-56337-fig1"/><p content-type="supplemental-figure"><fig id="fig1s1" specific-use="child-fig" orientation="portrait" position="anchor"><label>Figure 1&#x02014;figure supplement 1.</label><caption><title>ES cell-specific expression of KRAB-ZFP gene clusters.</title><p>(<bold>A</bold>) Heatmap showing expression patterns of mouse KRAB-ZFPs in 40 mouse tissues and cell lines (ENCODE). Heatmap colors indicate gene expression levels in log2 transcripts per million (TPM). The asterisk indicates a group of 30 KRAB-ZFPs that are exclusively expressed in ES cells. (<bold>B</bold>) Physical location of the genes encoding for the 30 KRAB-ZFPs that are exclusively expressed in ES cells. (<bold>C</bold>) Phylogenetic (Maximum likelihood) tree of the KRAB domains of mouse KRAB-ZFPs. KRAB-ZFPs encoded on the gene clusters on chromosome 2 and 4 are highlighted. The scale bar at the bottom indicates amino acid substitutions per site.</p></caption><graphic xlink:href="elife-56337-fig1-figsupp1"/></fig></p><p content-type="supplemental-figure"><fig id="fig1s2" specific-use="child-fig" orientation="portrait" position="anchor"><label>Figure 1&#x02014;figure supplement 2.</label><caption><title>KRAB-ZFP binding motifs and their repression activity.</title><p>(<bold>A</bold>) Comparison of computationally predicted (bottom) and experimentally determined (top) KRAB-ZFP binding motifs. Only significant pairs are shown (FDR&#x000a0;&#x0003c;&#x000a0;0.1). (<bold>B</bold>) Luciferase reporter assays to confirm KRAB-ZFP repression of the identified target sites. Bars show the luciferase activity (normalized to Renilla luciferase) of reporter plasmids containing the indicated target sites cloned upstream of the SV40 promoter. Reporter plasmids were co-transfected into 293 T cells with a Renilla luciferase plasmid for normalization and plasmids expressing the targeting KRAB-ZFP. Normalized mean luciferase activity (from three replicates) is shown relative to luciferase activity of the reporter plasmid co-transfected with an empty pcDNA3.1 vector.</p></caption><graphic xlink:href="elife-56337-fig1-figsupp2"/></fig></p><p content-type="supplemental-figure"><fig id="fig1s3" specific-use="child-fig" orientation="portrait" position="anchor"><label>Figure 1&#x02014;figure supplement 3.</label><caption><title>KRAB-ZFP binding to ETn retrotransposons.</title><p>(<bold>A</bold>) Comparison of the PBS<sup>Lys1,2</sup> sequence with Zfp961 binding motifs in nonrepetitive peaks (Nonrep) and peaks at ETn elements. (<bold>B</bold>) Retrotransposition assays of original (ETnI1-neoTNF and MusD2-neoTNF <xref rid="bib39" ref-type="bibr">Ribet et al., 2004</xref>) and modified reporter vectors where the Rex2 or Gm13051 binding motifs where removed. Schematic of reporter vectors are displayed at the top. HeLa cells were transfected as described in the Materials and Methods section and neo-resistant colonies, indicating retrotransposition events, were selected and stained. (<bold>C</bold>) Stem-loop structure of the ETn RNA export signal, the Gm13051 motif on the corresponding DNA is marked with red circles, the part of the motif that was deleted is indicated with grey crosses (adapted from <xref rid="bib25" ref-type="bibr">Legiewicz et al., 2010</xref>).</p></caption><graphic xlink:href="elife-56337-fig1-figsupp3"/></fig></p></fig><table-wrap id="table1" orientation="portrait" position="float"><label>Table 1.</label><caption><title>KRAB-ZFP genes clusters in the mouse genome that were investigated in this study.</title><p>* Number of protein-coding KRAB-ZFP genes identified in a previously published screen (<xref rid="bib17" ref-type="bibr">Imbeault et al., 2017</xref>) and the ChIP-seq data column indicates the number of KRAB-ZFPs for which ChIP-seq was performed in this study.</p></caption><table frame="hsides" rules="groups"><thead><tr><th valign="bottom" rowspan="1" colspan="1">Cluster</th><th valign="bottom" rowspan="1" colspan="1">Location</th><th valign="bottom" rowspan="1" colspan="1">Size (Mb)</th><th valign="bottom" rowspan="1" colspan="1"># of KRAB-ZFPs*</th><th valign="bottom" rowspan="1" colspan="1">ChIP-seq data</th></tr></thead><tbody><tr><td valign="bottom" rowspan="1" colspan="1">Chr2</td><td valign="bottom" rowspan="1" colspan="1">Chr2 qH4</td><td valign="bottom" rowspan="1" colspan="1">3.1</td><td valign="bottom" rowspan="1" colspan="1">40</td><td valign="bottom" rowspan="1" colspan="1">17</td></tr><tr><td valign="bottom" rowspan="1" colspan="1">Chr4</td><td valign="bottom" rowspan="1" colspan="1">Chr4 qE1</td><td valign="bottom" rowspan="1" colspan="1">2.3</td><td valign="bottom" rowspan="1" colspan="1">21</td><td valign="bottom" rowspan="1" colspan="1">19</td></tr><tr><td valign="bottom" rowspan="1" colspan="1">Chr10</td><td valign="bottom" rowspan="1" colspan="1">Chr10 qC1</td><td valign="bottom" rowspan="1" colspan="1">0.6</td><td valign="bottom" rowspan="1" colspan="1">6</td><td valign="bottom" rowspan="1" colspan="1">1</td></tr><tr><td valign="bottom" rowspan="1" colspan="1">Chr13.1</td><td valign="bottom" rowspan="1" colspan="1">Chr13 qB3</td><td valign="bottom" rowspan="1" colspan="1">1.2</td><td valign="bottom" rowspan="1" colspan="1">6</td><td valign="bottom" rowspan="1" colspan="1">2</td></tr><tr><td valign="bottom" rowspan="1" colspan="1">Chr13.2</td><td valign="bottom" rowspan="1" colspan="1">Chr13 qB3</td><td valign="bottom" rowspan="1" colspan="1">0.8</td><td valign="bottom" rowspan="1" colspan="1">26</td><td valign="bottom" rowspan="1" colspan="1">12</td></tr><tr><td valign="bottom" rowspan="1" colspan="1">Chr8</td><td valign="bottom" rowspan="1" colspan="1">Chr8 qB3.3</td><td valign="bottom" rowspan="1" colspan="1">0.1</td><td valign="bottom" rowspan="1" colspan="1">4</td><td valign="bottom" rowspan="1" colspan="1">4</td></tr><tr><td valign="bottom" rowspan="1" colspan="1">Chr9</td><td valign="bottom" rowspan="1" colspan="1">Chr9 qA3</td><td valign="bottom" rowspan="1" colspan="1">0.1</td><td valign="bottom" rowspan="1" colspan="1">4</td><td valign="bottom" rowspan="1" colspan="1">2</td></tr><tr><td valign="bottom" rowspan="1" colspan="1">Other</td><td valign="bottom" rowspan="1" colspan="1">-</td><td valign="bottom" rowspan="1" colspan="1">-</td><td valign="bottom" rowspan="1" colspan="1">248</td><td valign="bottom" rowspan="1" colspan="1">4</td></tr></tbody></table></table-wrap><p>We generally observed that KRAB-ZFPs present exclusively in mouse target TEs that are restricted to the mouse genome, indicating KRAB-ZFPs and their targets emerged together. For example, several mouse-specific KRAB-ZFPs in Chr2-cl and Chr4-cl target IAP and ETn elements which are only found in the mouse genome and are highly active. This is the strongest data to date supporting that recent KRAB-ZFP expansions in these young clusters is a response to recent TE activity. Likewise, ZFP599 and ZFP617, both conserved in Muroidea, bind to various ORR1-type LTRs which are present in the rat genome (<xref ref-type="supplementary-material" rid="supp1">Supplementary file 1</xref>). However, ZFP961, a KRAB-ZFP encoded on a small gene cluster on chromosome 8 that is conserved in Muroidea targets TEs that are only found in the mouse genome (e.g. ETn), a paradox we have previously observed with ZFP809, which also targets TEs that are evolutionarily younger than itself (<xref rid="bib54" ref-type="bibr">Wolf et al., 2015b</xref>). The ZFP961 binding site is located at the 5&#x02019; end of the internal region of ETn and ETnERV elements, a sequence that usually contains the primer binding site (PBS), which is required to prime retroviral reverse transcription. Indeed, the ZFP961 motif closely resembles the PBS<sup>Lys1,2</sup> (<xref ref-type="fig" rid="fig1s3">Figure 1&#x02014;figure supplement 3A</xref>), which had been previously identified as a KAP1-dependent target of retroviral repression (<xref rid="bib55" ref-type="bibr">Yamauchi et al., 1995</xref>; <xref rid="bib52" ref-type="bibr">Wolf et al., 2008</xref>). Repression of the PBS<sup>Lys1,2</sup> by ZFP961 was also confirmed in reporter assays (<xref ref-type="fig" rid="fig1s2">Figure 1&#x02014;figure supplement 2B</xref>), indicating that ZFP961 is likely responsible for this silencing effect.</p><p>To further test the hypothesis that KRAB-ZFPs target sites necessary for retrotransposition, we utilized previously generated ETn and ETnERV retrotransposition reporters in which we mutated KRAB-ZFP binding sites (<xref rid="bib39" ref-type="bibr">Ribet et al., 2004</xref>). Whereas the ETnERV reporters are sufficient for retrotransposition, the ETn reporter requires ETnERV genes supplied in trans. We tested and confirmed that the REX2/ZFP600 and GM13051 binding sites within these TEs are required for efficient retrotransposition (<xref ref-type="fig" rid="fig1s3">Figure 1&#x02014;figure supplement 3B</xref>). REX2 and ZFP600 both bind a target about 200 bp from the start of the internal region (<xref ref-type="fig" rid="fig1">Figure 1B</xref>), a region that often encodes the packaging signal. GM13051 binds a target coding for part of a highly structured mRNA export signal (<xref rid="bib25" ref-type="bibr">Legiewicz et al., 2010</xref>) near the 3&#x02019; end of the internal region of ETn (<xref ref-type="fig" rid="fig1s3">Figure 1&#x02014;figure supplement 3C</xref>). Both signals are characterized by stem-loop intramolecular base-pairing in which a single mutation can disrupt loop formation. This indicates that at least some KRAB-ZFPs evolved to bind functionally essential target sequences which cannot easily evade repression by mutation.</p><p>Our KRAB-ZFP ChIP-seq dataset also provided unique insights into the emergence of new KRAB-ZFPs and binding patterns. The Chr4-cl KRAB-ZFPs REX2 and ZFP600 bind to the same target within ETn but with varying affinity (<xref ref-type="fig" rid="fig1">Figure 1C</xref>). Comparison of the amino acids responsible for DNA contact revealed a high similarity between REX2 and ZFP600, with the main differences at the most C-terminal zinc fingers. Additionally, we found that GM30910, another KRAB-ZFP encoded in the Chr4-cl, also shows a strong similarity to both KRAB-ZFPs yet targets entirely different groups of TEs (<xref ref-type="fig" rid="fig1">Figure 1C</xref> and <xref ref-type="supplementary-material" rid="supp1">Supplementary file 1</xref>). Together with previously shown data (<xref rid="bib11" ref-type="bibr">Ecco et al., 2016</xref>), this example highlights how addition of a few new zinc fingers to an existing array can entirely shift the mode of DNA binding.</p></sec><sec id="s2-2"><title>Genetic deletion of KRAB-ZFP gene clusters leads to retrotransposon reactivation</title><p>The majority of KRAB-ZFP genes are harbored in large, highly repetitive clusters that have formed by successive complex segmental duplications (<xref rid="bib21" ref-type="bibr">Kauzlaric et al., 2017</xref>), rendering them inaccessible to conventional gene targeting. We therefore developed a strategy to delete entire KRAB-ZFP gene clusters in ES cells (including the Chr2-cl and Chr4-cl as well as two clusters on chromosome 13 and a cluster on chromosome 10) using two CRISPR/Cas9 gRNAs targeting unique regions flanking each cluster, and short single-stranded repair oligos with homologies to both sides of the projected cut sites. Using this approach, we generated five cluster KO ES cell lines in at least two biological replicates and performed RNA sequencing (RNA-seq) to determine TE expression levels. Strikingly, four of the five cluster KO ES cells exhibited distinct TE reactivation phenotypes (<xref ref-type="fig" rid="fig2">Figure 2A</xref>). Chr2-cl KO resulted in reactivation of several L1 subfamilies as well as RLTR10 (up to more than 100-fold as compared to WT) and IAPEz ERVs. In contrast, the most strongly upregulated TEs in Chr4-cl KO cells were ETn/ETnERV (up to 10-fold as compared to WT), with several other ERV groups modestly reactivated. ETn/ETnERV elements were also upregulated in Chr13.2-cl KO ES cells while the only upregulated ERVs in Chr13.1-cl KO ES cells were MMERVK10C elements (<xref ref-type="fig" rid="fig2">Figure 2A</xref>). Most reactivated retrotransposons were targeted by at least one KRAB-ZFP that was encoded in the deleted cluster (<xref ref-type="fig" rid="fig2">Figure 2A</xref> and <xref ref-type="supplementary-material" rid="supp1">Supplementary file 1</xref>), indicating a direct effect of these KRAB-ZFPs on TE expression levels. Furthermore, we observed a loss of KAP1 binding and H3K9me3 at several TE subfamilies that are targeted by at least one KRAB-ZFP within the deleted Chr2-cl and Chr4-cl (<xref ref-type="fig" rid="fig2">Figure 2B</xref>, <xref ref-type="fig" rid="fig2s1">Figure 2&#x02014;figure supplement 1A</xref>), including L1, ETn and IAPEz elements. Using reduced representation bisulfite sequencing (RRBS-seq), we found that a subset of KRAB-ZFP bound TEs were partially hypomethylated in Chr4-cl KO ES cells, but only when grown in genome-wide hypomethylation-inducing conditions (<xref rid="bib3" ref-type="bibr">Blaschke et al., 2013</xref>; <xref ref-type="fig" rid="fig2">Figure 2C</xref> and <xref ref-type="supplementary-material" rid="supp2">Supplementary file 2</xref>). These data are consistent with the hypothesis that KRAB-ZFPs/KAP1 are not required to establish DNA methylation, but under certain conditions they protect specific TEs and imprint control regions from genome-wide demethylation (<xref rid="bib27" ref-type="bibr">Leung et al., 2014</xref>; <xref rid="bib9" ref-type="bibr">Deniz et al., 2018</xref>).</p><fig id="fig2" position="float" orientation="portrait"><label>Figure 2.</label><caption><title>Retrotransposon reactivation in KRAB-ZFP cluster KO ES cells.</title><p>(<bold>A</bold>) RNA-seq analysis of TE expression in five KRAB-ZFP cluster KO ES cells. Green and grey squares on top of the panel represent KRAB-ZFPs with or without ChIP-seq data, respectively, within each deleted gene cluster. Reactivated TEs that are bound by one or several KRAB-ZFPs are indicated by green squares in the panel. Significantly up- and downregulated elements (<italic>adjusted p-value</italic>&#x0003c;0.05) are highlighted in red and green, respectively. (<bold>B</bold>) Differential KAP1 binding and H3K9me3 enrichment at TE groups (summarized across all insertions) in Chr2-cl and Chr4-cl KO ES cells. TE groups targeted by one or several KRAB-ZFPs encoded within the deleted clusters are highlighted in blue (differential enrichment over the entire TE sequences) and red (differential enrichment at TE regions that overlap with KRAB-ZFP ChIP-seq peaks). (<bold>C</bold>) DNA methylation status of CpG sites at indicated TE groups in WT and Chr4-cl KO ES cells grown in serum containing media or in hypomethylation-inducing media (2i + Vitamin C). P-values were calculated using paired t-test.</p><p><supplementary-material content-type="local-data" id="fig2sdata1"><label>Figure 2&#x02014;source data 1.</label><caption><title>Differential H3K9me3 and KAP1 distribution in WT and KRAB-ZFP cluster KO ES cells at TE families and KRAB-ZFP bound TE insertions.</title><p>Differential read counts and statistical testing were determined by DESeq2.</p></caption><media mime-subtype="xlsx" mimetype="application" xlink:href="elife-56337-fig2-data1.xlsx" orientation="portrait" id="d36e873" position="anchor"/></supplementary-material></p></caption><graphic xlink:href="elife-56337-fig2"/><p content-type="supplemental-figure"><fig id="fig2s1" specific-use="child-fig" orientation="portrait" position="anchor"><label>Figure 2&#x02014;figure supplement 1.</label><caption><title>Epigenetic changes at TEs and TE-borne enhancers in KRAB-ZFP cluster KO ES cells.</title><p>(<bold>A</bold>) Differential analysis of summative (all individual insertions combined) H3K9me3 enrichment at TE groups in Chr10-cl, Chr13.1-cl and Chr13.2-cl KO ES cells.&#x000a0;TE groups targeted by one or several KRAB-ZFPs encoded within the deleted clusters are highlighted in orange (differential enrichment over the entire TE sequences) and red (differential enrichment at TE regions that overlap with KRAB-ZFP ChIP-seq peaks). (<bold>B</bold>) Top: Schematic view of the <italic>Cd59a</italic>/<italic>Cd59b</italic> locus with a 5&#x02019; truncated ETn insertion. ChIP-seq (Input subtracted from ChIP) data for overexpressed epitope-tagged Gm13051 (a Chr4-cl KRAB-ZFP) in F9 EC cells, and re-mapped KAP1 (GEO accession: GSM1406445) and H3K9me3 (GEO accession: GSM1327148) in WT ES cells are shown together with RNA-seq data from Chr4-cl WT and KO ES cells (mapped using Bowtie (-a -m 1 --<monospace>strata</monospace> -v 2) to exclude reads that cannot be uniquely mapped). Bottom: Transcriptional activity of a 5 kb fragment with or without fragments of the ETn insertion was tested by luciferase reporter assay in Chr4-cl WT and KO ES cells.</p></caption><graphic xlink:href="elife-56337-fig2-figsupp1"/></fig></p></fig></sec><sec id="s2-3"><title>KRAB-ZFP cluster deletions license TE-borne enhancers</title><p>We next used our RNA-seq datasets to determine the effect of KRAB-ZFP cluster deletions on gene expression. We identified 195 significantly upregulated and 130 downregulated genes in Chr4-cl KO ES cells, and 108 upregulated and 59 downregulated genes in Chr2-cl KO ES cells (excluding genes on the deleted cluster) (<xref ref-type="fig" rid="fig3">Figure 3A</xref>). To address whether gene deregulation in Chr2-cl and Chr4-cl KO ES cells is caused by nearby TE reactivation, we determined whether genes near certain TE subfamilies are more frequently deregulated than random genes. We found a strong correlation of gene upregulation and TE proximity for several TE subfamilies, of which many became transcriptionally activated themselves (<xref ref-type="fig" rid="fig3">Figure 3B</xref>). For example, nearly 10% of genes that are located within 100 kb (up- or downstream of the TSS) of an ETn element are upregulated in Chr4-cl KO ES cells, as compared to 0.8% of all genes. In Chr2-cl KO ES cells, upregulated genes were significantly enriched near various LINE groups but also IAPEz-int and RLTR10-int elements, indicating that TE-binding KRAB-ZFPs in these clusters limit the potential activating effects of TEs on nearby genes.</p><fig id="fig3" orientation="portrait" position="float"><label>Figure 3.</label><caption><title>TE-dependent gene activation in KRAB-ZFP cluster KO ES cells.</title><p>(<bold>A</bold>) Differential gene expression in Chr2-cl and Chr4-cl KO ES cells. Significantly up- and downregulated genes (<italic>adjusted p-value</italic>&#x0003c;0.05) are highlighted in red and green, respectively, KRAB-ZFP genes within the deleted clusters are shown in blue. (<bold>B</bold>) Correlation of TEs and gene deregulation. Plots show enrichment of TE groups within 100 kb of up- and downregulated genes relative to all genes. Significantly overrepresented LTR and LINE groups (<italic>adjusted p-value</italic>&#x0003c;0.1) are highlighted in blue and red, respectively. (<bold>C</bold>) Schematic view of the downstream region of <italic>Chst1</italic> where a 5&#x02019; truncated ETn insertion is located. ChIP-seq (Input subtracted from ChIP) data for overexpressed epitope-tagged Gm13051 (a Chr4-cl KRAB-ZFP) in F9 EC cells, and re-mapped KAP1 (GEO accession: GSM1406445) and H3K9me3 (GEO accession: GSM1327148) in WT ES cells are shown together with RNA-seq data from Chr4-cl WT and KO ES cells (mapped using Bowtie (-a -m 1 --<monospace>strata</monospace> -v 2) to exclude reads that cannot be uniquely mapped). (<bold>D</bold>) RT-qPCR analysis of Chst1 mRNA expression in Chr4-cl WT and KO ES cells with or without the CRISPR/Cas9 deleted ETn insertion near <italic>Chst1</italic>. Values represent mean expression (normalized to Gapdh) from three biological replicates per sample (each performed in three technical replicates) in arbitrary units. Error bars represent standard deviation and asterisks indicate significance (p&#x0003c;0.01, Student&#x02019;s t-test). n.s.: not significant. (<bold>E</bold>) Mean coverage of ChIP-seq data (Input subtracted from ChIP) in Chr4-cl WT and KO ES cells over 127 full-length ETn insertions. The binding sites of the Chr4-cl KRAB-ZFPs Rex2 and Gm13051 are indicated by dashed lines.</p></caption><graphic xlink:href="elife-56337-fig3"/></fig><p>While we generally observed that TE-associated gene reactivation is not caused by elongated or spliced transcription starting at the retrotransposons, we did observe that the strength of the effect of ETn elements on gene expression is stronger on genes in closer proximity. About 25% of genes located within 20 kb of an ETn element, but only 5% of genes located at a distance between 50 and 100 kb from the nearest ETn insertion, become upregulated in Chr4-cl KO ES cells. Importantly however, the correlation is still significant for genes that are located at distances between 50 and 100 kb from the nearest ETn insertion, indicating that ETn elements can act as long-range enhancers of gene expression in the absence of KRAB-ZFPs that target them. To confirm that Chr4-cl KRAB-ZFPs such as GM13051 block ETn-borne enhancers, we tested the ability of a putative ETn enhancer to activate transcription in a reporter assay. For this purpose, we cloned a 5 kb fragment spanning from the GM13051 binding site within the internal region of a truncated ETn insertion to the first exon of the <italic>Cd59a</italic> gene, which is strongly activated in Chr4-cl KO ES cells (<xref ref-type="fig" rid="fig2s1">Figure 2&#x02014;figure supplement 1B</xref>). We observed strong transcriptional activity of this fragment which was significantly higher in Chr4-cl KO ES cells. Surprisingly, this activity was reduced to background when the internal segment of the ETn element was not included in the fragment, suggesting the internal segment of the ETn element, but not its LTR, contains a Chr4-cl KRAB-ZFP sensitive enhancer. To further corroborate these findings, we genetically deleted an ETn element that is located about 60 kb from the TSS of <italic>Chst1</italic>, one of the top-upregulated genes in Chr4-cl KO ES cells (<xref ref-type="fig" rid="fig3">Figure 3C</xref>). RT-qPCR analysis revealed that the <italic>Chst1</italic> upregulation phenotype in Chr4-cl KO ES cells diminishes when the ETn insertion is absent, providing direct evidence that a KRAB-ZFP controlled ETn-borne enhancer regulates <italic>Chst1</italic> expression (<xref ref-type="fig" rid="fig3">Figure 3D</xref>). Furthermore, ChIP-seq confirmed a general increase of H3K4me3, H3K4me1 and H3K27ac marks at ETn elements in Chr4-cl KO ES cells (<xref ref-type="fig" rid="fig3">Figure 3E</xref>). Notably, enhancer marks were most pronounced around the GM13051 binding site near the 3&#x02019; end of the internal region, confirming that the enhancer activity of ETn is located on the internal region and not on the LTR.</p></sec><sec id="s2-4"><title>ETn retrotransposition in Chr4-cl KO and WT mice</title><p>IAP, ETn/ETnERV and MuLV/RLTR4 retrotransposons are highly polymorphic in inbred mouse strains (<xref rid="bib36" ref-type="bibr">Nell&#x000e5;ker et al., 2012</xref>), indicating that these elements are able to mobilize in the germ line. Since these retrotransposons are upregulated in Chr2-cl and Chr4-cl KO ES cells, we speculated that these KRAB-ZFP clusters evolved to minimize the risks of insertional mutagenesis by retrotransposition. To test this, we generated Chr2-cl and Chr4-cl KO mice via ES cell injection into blastocysts, and after germ line transmission we genotyped the offspring of heterozygous breeding pairs. While the offspring of Chr4-cl KO/WT parents were born close to Mendelian ratios in pure C57BL/6 and mixed C57BL/6 129Sv matings, one Chr4-cl KO/WT breeding pair gave birth to significantly fewer KO mice than expected (p-value=0.022) (<xref ref-type="fig" rid="fig4s1">Figure 4&#x02014;figure supplement 1A</xref>). Likewise, two out of four Chr2-cl KO breeding pairs on mixed C57BL/6 129Sv matings failed to give birth to a single KO offspring (p-value&#x0003c;0.01) while the two other mating pairs produced KO offspring at near Mendelian ratios (<xref ref-type="fig" rid="fig4s1">Figure 4&#x02014;figure supplement 1A</xref>). Altogether, these data indicate that KRAB-ZFP clusters are not absolutely essential in mice, but that genetic and/or epigenetic factors may contribute to reduced viability.</p><p>We reasoned that retrotransposon activation could account for the reduced viability of Chr2-cl and Chr4-cl KO mice in some matings. However, since only rare matings produced non-viable KO embryos, we instead turned to the viable KO mice to assay for increased transposon activity. RNA-seq in blood, brain and testis revealed that, with a few exceptions, retrotransposons upregulated in Chr2 and Chr4 KRAB-ZFP cluster KO ES cells are not expressed at higher levels in adult tissues (<xref ref-type="fig" rid="fig4s1">Figure 4&#x02014;figure supplement 1B</xref>). Likewise, no strong transcriptional TE reactivation phenotype was observed in liver and kidney of Chr4-cl KO mice (data not shown) and ChIP-seq with antibodies against H3K4me1, H3K4me3 and H3K27ac in testis of Chr4-cl WT and KO mice revealed no increase of active histone marks at ETn elements or other TEs (data not shown). This indicates that Chr2-cl and Chr4-cl KRAB-ZFPs are primarily required for TE repression during early development. This is consistent with the high expression of these KRAB-ZFPs uniquely in ES cells (<xref ref-type="fig" rid="fig1s1">Figure 1&#x02014;figure supplement 1A</xref>). To determine whether retrotransposition occurs at a higher frequency in Chr4-cl KO mice during development, we screened for novel ETn (ETn/ETnERV) and MuLV (MuLV/RLTR4_MM) insertions in viable Chr4-cl KO mice. For this purpose, we developed a capture-sequencing approach to enrich for ETn/MuLV DNA and flanking sequences from genomic DNA using probes that hybridize with the 5&#x02019; and 3&#x02019; ends of ETn and MuLV LTRs prior to deep sequencing. We screened genomic DNA samples from a total of 76 mice, including 54 mice from ancestry-controlled Chr4-cl KO matings in various strain backgrounds, the two ES cell lines the Chr4-cl KO mice were generated from, and eight mice from a Chr2-cl KO mating which served as a control (since ETn and MuLVs are not activated in Chr2-cl KO ES cells) (<xref ref-type="supplementary-material" rid="supp4">Supplementary file 4</xref>). Using this approach, we were able to enrich reads mapping to ETn/MuLV LTRs about 2,000-fold compared to genome sequencing without capture. ETn/MuLV insertions were determined by counting uniquely mapped reads that were paired with reads mapping to ETn/MuLV elements (see materials and methods for details). To assess the efficiency of the capture approach, we determined what proportion of a set of 309 largely intact (two LTRs flanking an internal sequence) reference ETn elements could be identified using our sequencing data. 95% of these insertions were called with high confidence in the majority of our samples (data not shown), indicating that we are able to identify ETn insertions at a high recovery rate.</p><p>Using this dataset, we first confirmed the polymorphic nature of both ETn and MuLV retrotransposons in laboratory mouse strains (<xref ref-type="fig" rid="fig4s2">Figure 4&#x02014;figure supplement 2A</xref>), highlighting the potential of these elements to retrotranspose. To identify novel insertions, we filtered out insertions that were supported by ETn/MuLV-paired reads in more than one animal. While none of the 54 ancestry-controlled mice showed a single novel MuLV insertion, we observed greatly varying numbers of up to 80 novel ETn insertions in our pedigree (<xref ref-type="fig" rid="fig4">Figure 4A</xref>).</p><fig id="fig4" position="float" orientation="portrait"><label>Figure 4.</label><caption><title>ETn retrotransposition in Chr4-cl KO mice.</title><p>(<bold>A</bold>) Pedigree of mice used for transposon insertion screening by capture-seq in mice of different strain backgrounds. The number of novel ETn insertions (only present in one animal) are indicated. For animals whose direct ancestors have not been screened, the ETn insertions are shown in parentheses since parental inheritance cannot be excluded in that case. Germ line insertions are indicated by asterisks. All DNA samples were prepared from tail tissues unless noted (-S: spleen, -E: ear, -B:Blood) (<bold>B</bold>) Statistical analysis of ETn insertion frequency in tail tissue from 30 Chr4-cl KO, KO/WT and WT mice that were derived from one Chr4-c KO x KO/WT and two Chr4-cl KO/WT x KO/WT matings. Only DNA samples that were collected from juvenile tails were considered for this analysis. P-values were calculated using one-sided Wilcoxon Rank Sum Test. In the last panel, KO, WT and KO/WT mice derived from all matings were combined for the statistical analysis.</p><p><supplementary-material content-type="local-data" id="fig4sdata1"><label>Figure 4&#x02014;source data 1.</label><caption><title>Coordinates of identified novel ETn insertions and supporting capture-seq read counts.</title><p>Genomic regions indicate cluster of supporting reads.</p></caption><media mime-subtype="xlsx" mimetype="application" xlink:href="elife-56337-fig4-data1.xlsx" orientation="portrait" id="d36e1034" position="anchor"/></supplementary-material></p><p><supplementary-material content-type="local-data" id="fig4sdata2"><label>Figure 4&#x02014;source data 2.</label><caption><title>Sequences of capture-seq probes used to enrich genomic DNA for ETn and MuLV (RLTR4) insertions.</title></caption><media mime-subtype="plain" mimetype="text" xlink:href="elife-56337-fig4-data2.txt" orientation="portrait" id="d36e1042" position="anchor"/></supplementary-material></p></caption><graphic xlink:href="elife-56337-fig4"/><p content-type="supplemental-figure"><fig id="fig4s1" specific-use="child-fig" orientation="portrait" position="anchor"><label>Figure 4&#x02014;figure supplement 1.</label><caption><title>Birth statistics of KRAB-ZFP cluster KO mice and TE reactivation in adult tissues.</title><p>(<bold>A</bold>) Birth statistics of Chr4- and Chr2-cl mice derived from KO/WT x KO/WT matings in different strain backgrounds.&#x000a0;(<bold>B</bold>) RNA-seq analysis of TE expression in Chr2- (left) and Chr4-cl (right) KO tissues. TE groups with the highest reactivation phenotype in ES cells are shown separately. Significantly up- and downregulated elements (<italic>adjusted p-value</italic>&#x0003c;0.05) are highlighted in red and green, respectively. Experiments were performed in at least two biological replicates.</p></caption><graphic xlink:href="elife-56337-fig4-figsupp1"/></fig></p><p content-type="supplemental-figure"><fig id="fig4s2" specific-use="child-fig" orientation="portrait" position="anchor"><label>Figure 4&#x02014;figure supplement 2.</label><caption><title>Identification of polymorphic ETn and MuLV retrotransposon insertions in Chr4-cl KO and WT mice.</title><p>Heatmaps show normalized capture-seq read counts in RPM (Read Per Million) for identified polymorphic ETn (<bold>A</bold>) and MuLV (<bold>B</bold>) loci in different mouse strains. Only loci with strong support for germ line ETn or MuLV insertions (at least 100 or 3000 ETn or MuLV RPM, respectively) in at least two animals are shown. Non-polymorphic insertion loci with high read counts in all screened mice were excluded for better visibility. The sample information (sample name and cell type/tissue) is annotated at the bottom, with the strain information indicated by color at the top. The color gradient indicates log10(RPM+1).</p></caption><graphic xlink:href="elife-56337-fig4-figsupp2"/></fig></p><p content-type="supplemental-figure"><fig id="fig4s3" specific-use="child-fig" orientation="portrait" position="anchor"><label>Figure 4&#x02014;figure supplement 3.</label><caption><title>Confirmation of novel ETn insertions identified by capture-seq.</title><p>(<bold>A</bold>) PCR validation of novel ETn insertions in genomic DNA of three littermates (IDs: T09673, T09674 and T00436) and their parents (T3913 and T3921). Primer sequences are shown in <xref ref-type="supplementary-material" rid="supp3">Supplementary file 3</xref>. (<bold>B</bold>) ETn capture-seq read counts (RPM) at putative novel somatic (loci identified exclusively in one single animal), novel germ line (loci identified in several littermates) insertions, and at B6 reference ETn elements. (<bold>C</bold>) Heatmap shows capture-seq read counts (RPM) of a Chr4-cl KO mouse (ID: C6733) as determined in different tissues. Each row represents a novel ETn locus that was identified in at least one tissue. The color gradient indicates log10(RPM+1). (<bold>D</bold>) Heatmap shows the capture-seq RPM in technical replicates using the same Chr4-cl KO DNA sample (rep1/rep2) or replicates with DNA samples prepared from different sections of the tail from the same mouse at different ages (tail1/tail2). Each row represents a novel ETn locus that was identified in at least one of the displayed samples. The color gradient indicates log10(RPM+1).</p></caption><graphic xlink:href="elife-56337-fig4-figsupp3"/></fig></p></fig><p>To validate some of the novel ETn insertions, we designed specific PCR primers for five of the insertions and screened genomic DNA of the mice in which they were identified as well as their parents. For all tested insertions, we were able to amplify their flanking sequence and show that these insertions are absent in their parents (<xref ref-type="fig" rid="fig4s3">Figure 4&#x02014;figure supplement 3A</xref>). To confirm their identity, we amplified and sequenced three of the novel full-length ETn insertions. Two of these elements (Genbank accession: MH449667-68) resembled typical ETnII elements with identical 5&#x02019; and 3&#x02019; LTRs and target site duplications (TSD) of 4 or 6 bp, respectively. The third sequenced element (MH449669) represented a hybrid element that contains both ETnI and MusD (ETnERV) sequences. Similar insertions can be found in the B6 reference genome; however, the identified novel insertion has a 2.5 kb deletion of the 5&#x02019; end of the internal region. Additionally, the 5&#x02019; and 3&#x02019; LTR of this element differ in one nucleotide near the start site and contain an unusually large 248 bp TSD (containing a SINE repeat) indicating that an improper integration process might have truncated this element.</p><p>Besides novel ETn insertions that were only identified in one specific animal, we also observed three ETn insertions that could be detected in several siblings but not in their parents or any of the other screened mice. This strongly indicates that these retrotransposition events occurred in the germ line of the parents from which they were passed on to some of their offspring. One of these germ line insertions was evidently passed on from the offspring to the next generation (<xref ref-type="fig" rid="fig4">Figure 4A</xref>). As expected, the read numbers supporting these novel germ line insertions were comparable to the read numbers that were found in the flanking regions of annotated B6 ETn insertions (<xref ref-type="fig" rid="fig4s3">Figure 4&#x02014;figure supplement 3B</xref>). In contrast, virtually all novel insertions that were only found in one animal were supported by significantly fewer reads (<xref ref-type="fig" rid="fig4s3">Figure 4&#x02014;figure supplement 3B</xref>). This indicates that these elements resulted from retrotransposition events in the developing embryo and not in the zygote or parental germ cells. Indeed, we detected different sets of insertions in various tissues from the same animal (<xref ref-type="fig" rid="fig4s3">Figure 4&#x02014;figure supplement 3C</xref>). Even between tail samples that were collected from the same animal at different ages, only a fraction of the new insertions were present in both samples, while technical replicates from the same genomic DNA samples showed a nearly complete overlap in insertions (<xref ref-type="fig" rid="fig4s3">Figure 4&#x02014;figure supplement 3D</xref>).</p><p>Finally, we asked whether there were more novel ETn insertions in mice lacking the Chr4-cl relative to their wild type and heterozygous littermates in our pedigree. Interestingly, only one out of the eight Chr4-cl KO mice in a pure C57BL/6 strain background and none of the eight offspring from a Chr2-cl mating carried a single novel ETn insertion (<xref ref-type="fig" rid="fig4">Figure 4A</xref>). When crossing into a 129Sv background for a single generation before intercrossing heterozygous mice (F1), we observed 4 out of 8 Chr4-cl KO mice that contained at least one new ETn insertion, whereas none of 3 heterozygous mice contained any insertions. After crossing to the 129Sv background for a second generation (F2), we determined the number of novel ETn insertions in the offspring of one KO/WT x KO and two KO/WT x KO/WT matings, excluding all samples that were not derived from juvenile tail tissue. Only in the offspring of the KO/WT x KO mating, we observed a statistically significant higher average number of ETn insertions in KO vs. KO/WT animals (7.3 vs. 29.6, p=0.045, <xref ref-type="fig" rid="fig4">Figure 4B</xref>). Other than that, only a non-significant trend towards greater average numbers of ETn insertions in KO (11 vs. 27.8, p=0.192, <xref ref-type="fig" rid="fig4">Figure 4B</xref>) was apparent in one of the WT/KO x KO/WT matings whereas no difference in ETn insertion numbers between WT and KO mice could be observed in the second mating WT/KO x KO/WT (26 vs. 31, p=0.668, <xref ref-type="fig" rid="fig4">Figure 4B</xref>). When comparing all KO with all WT and WT/KO mice from these three matings, a trend towards more ETn insertions in KO remained but was not supported by strong significance (26 vs. 13, p=0.057, <xref ref-type="fig" rid="fig4">Figure 4B</xref>). Altogether, we observed a high variability in the number of new ETn insertions in both KO and WT but our data suggest that the Chr4-cl KRAB-ZFPs may have a modest effect on ETn retrotransposition rates in some mouse strains but other genetic and epigenetic effects clearly also play an important role.</p></sec></sec><sec sec-type="discussion" id="s3"><title>Discussion</title><p>C2H2 zinc finger proteins, about half of which contain a KRAB repressor domain, represent the largest DNA-binding protein family in mammals. Nevertheless, most of these factors have not been investigated using loss-of-function studies. The most comprehensive characterization of human KRAB-ZFPs revealed a strong preference to bind TEs (<xref rid="bib17" ref-type="bibr">Imbeault et al., 2017</xref>;&#x000a0;<xref rid="bib35" ref-type="bibr">Najafabadi et al., 2015</xref>) yet their function remains unknown. In humans, very few TEs are capable of retrotransposition yet many of them, often tens of million years old, are bound by KRAB-ZFPs. While this suggests that human KRAB-ZFPs mainly serve to control TE-borne enhancers and may have potentially transcription-independent functions, we were interested in the biological significance of KRAB-ZFPs in restricting potentially active TEs. The mouse is an ideal model for such studies since the mouse genome contains several active TE families, including IAP, ETn and L1 elements. We found that many of the young KRAB-ZFPs present in the genomic clusters of KRAB-ZFPs on chromosomes 2 and 4, which are highly expressed in a restricted pattern in ES cells, bound redundantly to these three active TE families. In several cases, KRAB-ZFPs bound to functionally constrained sequence elements we and others have demonstrated to be necessary for retrotransposition, including PBS and viral packaging signals. Targeting such sequences may help the host defense system keep pace with rapidly evolving mouse transposons. This provides strong evidence that many young KRAB-ZFPs are indeed expanding in response to TE activity. But do these young KRAB-ZFP genes limit the mobilization of TEs? Despite the large number of polymorphic ETn elements in mouse strains (<xref rid="bib36" ref-type="bibr">Nell&#x000e5;ker et al., 2012</xref>) and several reports of phenotype-causing novel ETn germ line insertions, no new ETn insertions were reported in recent screens of C57BL/6 mouse genomes (<xref rid="bib40" ref-type="bibr">Richardson et al., 2017</xref>; <xref rid="bib14" ref-type="bibr">Gagnier et al., 2019</xref>), indicating that the overall rate of ETn germ line mobilization in inbred mice is rather low. We have demonstrated that Chr4-cl KRAB-ZFPs control ETn/ETnERV expression in ES cells, but this does not lead to widespread ETn mobility in viable C57BL/6 mice. In contrast, we found numerous novel, including several germ line, ETn insertions in both WT and Chr4-cl KO mice in a C57BL/6 129Sv mixed genetic background, with generally more insertions in KO mice and in mice with more 129Sv DNA. This is consistent with a report detecting ETn insertions in FVB.129 mice (<xref rid="bib43" ref-type="bibr">Schauer et al., 2018</xref>). Notably, there was a large variation in the number of new insertions in these mice, possibly caused by hyperactive polymorphic ETn insertions that varied from individual to individual, epigenetic variation at ETn insertions between individuals and/or the general stochastic nature of ETn mobilization. Furthermore, recent reports have suggested that KRAB-ZFP gene content is distinct in different strains of laboratory mice (<xref rid="bib28" ref-type="bibr">Lilue et al., 2018</xref>; <xref rid="bib49" ref-type="bibr">Treger et al., 2019</xref>), and reduced KRAB-ZFP gene content could contribute to increased activity in individual mice. Although we have yet to find obvious phenotypes in the mice carrying new insertions, novel ETn germ line insertions have been shown to cause phenotypes from short tails (<xref rid="bib31" ref-type="bibr">Lugani et al., 2013</xref>; <xref rid="bib45" ref-type="bibr">Semba et al., 2013</xref>; <xref rid="bib50" ref-type="bibr">Vlangos et al., 2013</xref>) to limb malformation (<xref rid="bib19" ref-type="bibr">Kano et al., 2007</xref>) and severe morphogenetic defects including polypodia (<xref rid="bib26" ref-type="bibr">Lehoczky et al., 2013</xref>) depending upon their insertion site.</p><p>Despite a lack of widespread ETn activation in Chr4-cl KO mice, it still remains to be determined whether other TEs, like L1, IAP or other LTR retrotransposons are activated in any of the KRAB-ZFP cluster KO mice, which will require the development of additional capture-seq based assays. Notably, two of the heterozygous matings from Chr2-cl KO mice failed to produce viable knockout offspring, which could indicate a TE-reactivation phenotype. It may also be necessary to generate compound homozygous mutants of distinct KRAB-ZFP clusters to eliminate redundancy before TEs become unleashed. The KRAB-ZFP cluster knockouts produced here will be useful reagents to test such hypotheses. In sum, our data supports that a major driver of KRAB-ZFP gene expansion in mice is recent retrotransposon insertions, and that redundancy within the KRAB-ZFP gene family and with other TE restriction pathways provides protection against widespread TE mobility, explaining the non-essential function of the majority of KRAB-ZFP genes.</p></sec><sec sec-type="materials|methods" id="s4"><title>Materials and methods</title><table-wrap id="keyresource" orientation="portrait" position="anchor"><label>Key resources table</label><table frame="hsides" rules="groups"><thead><tr><th valign="top" rowspan="1" colspan="1">Reagent type <break/>(species) or <break/>resource</th><th valign="top" rowspan="1" colspan="1">Designation</th><th valign="top" rowspan="1" colspan="1">Source or <break/>reference</th><th valign="top" rowspan="1" colspan="1">Identifiers</th><th valign="top" rowspan="1" colspan="1">Additional <break/>information</th></tr></thead><tbody><tr><td valign="top" rowspan="1" colspan="1">Strain, strain background (<italic>Mus musculus</italic>)</td><td valign="top" rowspan="1" colspan="1">129 &#x000d7; 1/SvJ</td><td valign="top" rowspan="1" colspan="1">The Jackson Laboratory</td><td valign="top" rowspan="1" colspan="1">000691</td><td valign="top" rowspan="1" colspan="1">Mice used to generate mixed strain Chr4-cl KO mice</td></tr><tr><td valign="top" rowspan="1" colspan="1">Cell line (<italic>Homo-sapiens</italic>)</td><td valign="top" rowspan="1" colspan="1">HeLa</td><td valign="top" rowspan="1" colspan="1">ATCC</td><td valign="top" rowspan="1" colspan="1">ATCC CCL-2</td><td valign="top" rowspan="1" colspan="1"/></tr><tr><td valign="top" rowspan="1" colspan="1">Cell line (<italic>Mus musculus</italic>)</td><td valign="top" rowspan="1" colspan="1">JM8A3.N1 C57BL/6N-Atm1Brd</td><td valign="top" rowspan="1" colspan="1">KOMP Repository</td><td valign="top" rowspan="1" colspan="1">PL236745</td><td valign="top" rowspan="1" colspan="1">B6 ES cells used to generate KO cell lines and mice</td></tr><tr><td valign="top" rowspan="1" colspan="1">Cell line (<italic>Mus musculus</italic>)</td><td valign="top" rowspan="1" colspan="1">B6;129&#x02010; Gt(ROSA)26Sortm1(cre/ERT)Nat/J</td><td valign="top" rowspan="1" colspan="1">The Jackson Laboratory</td><td valign="top" rowspan="1" colspan="1">004847</td><td valign="top" rowspan="1" colspan="1">ES cells used to generate KO cell lines and mice</td></tr><tr><td valign="top" rowspan="1" colspan="1">Cell line (<italic>Mus musculus</italic>)</td><td valign="top" rowspan="1" colspan="1">R1 ES cells</td><td valign="top" rowspan="1" colspan="1">Andras Nagy lab</td><td valign="top" rowspan="1" colspan="1">R1</td><td valign="top" rowspan="1" colspan="1">129 ES cells used to generate KO cell lines and mice</td></tr><tr><td valign="top" rowspan="1" colspan="1">Cell line (<italic>Mus musculus</italic>)</td><td valign="top" rowspan="1" colspan="1">F9 Embryonic carcinoma cells</td><td valign="top" rowspan="1" colspan="1">ATCC</td><td valign="top" rowspan="1" colspan="1">ATCC CRL-1720</td><td valign="top" rowspan="1" colspan="1"/></tr><tr><td valign="top" rowspan="1" colspan="1">Antibody</td><td valign="top" rowspan="1" colspan="1">Mouse monoclonal ANTI-FLAG M2 antibody</td><td valign="top" rowspan="1" colspan="1">Sigma-Aldrich</td><td valign="top" rowspan="1" colspan="1">Cat# F1804, RRID:<ext-link ext-link-type="uri" xlink:href="https://scicrunch.org/resolver/AB_262044">AB_262044</ext-link></td><td valign="top" rowspan="1" colspan="1">ChIP (1 &#x000b5;g/10<sup>7</sup> cells)</td></tr><tr><td valign="top" rowspan="1" colspan="1">Antibody</td><td valign="top" rowspan="1" colspan="1">Rabbit polyclonal anti-HA</td><td valign="top" rowspan="1" colspan="1">Abcam</td><td valign="top" rowspan="1" colspan="1">Cat# ab9110, RRID:<ext-link ext-link-type="uri" xlink:href="https://scicrunch.org/resolver/AB_307019">AB_307019</ext-link></td><td valign="top" rowspan="1" colspan="1">ChIP (1 &#x000b5;g/10<sup>7</sup> cells)</td></tr><tr><td valign="top" rowspan="1" colspan="1">Antibody</td><td valign="top" rowspan="1" colspan="1">Mouse monoclonal anti-HA</td><td valign="top" rowspan="1" colspan="1">Covance</td><td valign="top" rowspan="1" colspan="1">Cat# MMS-101P-200, RRID:<ext-link ext-link-type="uri" xlink:href="https://scicrunch.org/resolver/AB_10064068">AB_10064068</ext-link></td><td valign="top" rowspan="1" colspan="1"/></tr><tr><td valign="top" rowspan="1" colspan="1">Antibody</td><td valign="top" rowspan="1" colspan="1">Rabbit polyclonal anti-H3K9me3</td><td valign="top" rowspan="1" colspan="1">Active Motif</td><td valign="top" rowspan="1" colspan="1">Cat# 39161, RRID:<ext-link ext-link-type="uri" xlink:href="https://scicrunch.org/resolver/AB_2532132">AB_2532132</ext-link></td><td valign="top" rowspan="1" colspan="1">ChIP (3 &#x000b5;l/10<sup>7</sup> cells)</td></tr><tr><td valign="top" rowspan="1" colspan="1">Antibody</td><td valign="top" rowspan="1" colspan="1">Rabbit polyclonal anti-GFP</td><td valign="top" rowspan="1" colspan="1">Thermo Fisher Scientific</td><td valign="top" rowspan="1" colspan="1">Cat# A-11122, RRID:<ext-link ext-link-type="uri" xlink:href="https://scicrunch.org/resolver/AB_221569">AB_221569</ext-link></td><td valign="top" rowspan="1" colspan="1">ChIP (1 &#x000b5;g/10<sup>7</sup> cells)</td></tr><tr><td valign="top" rowspan="1" colspan="1">Antibody</td><td valign="top" rowspan="1" colspan="1">Rabbit polyclonal anti- H3K4me3</td><td valign="top" rowspan="1" colspan="1">Abcam</td><td valign="top" rowspan="1" colspan="1">Cat# ab8580, RRID:<ext-link ext-link-type="uri" xlink:href="https://scicrunch.org/resolver/AB_306649">AB_306649</ext-link></td><td valign="top" rowspan="1" colspan="1">ChIP (1 &#x000b5;g/10<sup>7</sup> cells)</td></tr><tr><td valign="top" rowspan="1" colspan="1">Antibody</td><td valign="top" rowspan="1" colspan="1">Rabbit polyclonal anti- H3K4me1</td><td valign="top" rowspan="1" colspan="1">Abcam</td><td valign="top" rowspan="1" colspan="1">Cat# ab8895, RRID:<ext-link ext-link-type="uri" xlink:href="https://scicrunch.org/resolver/AB_306847">AB_306847</ext-link></td><td valign="top" rowspan="1" colspan="1">ChIP (1 &#x000b5;g/10<sup>7</sup> cells)</td></tr><tr><td valign="top" rowspan="1" colspan="1">Antibody</td><td valign="top" rowspan="1" colspan="1">Rabbit polyclonal anti- H3K27ac</td><td valign="top" rowspan="1" colspan="1">Abcam</td><td valign="top" rowspan="1" colspan="1">Cat# ab4729, RRID:<ext-link ext-link-type="uri" xlink:href="https://scicrunch.org/resolver/AB_2118291">AB_2118291</ext-link></td><td valign="top" rowspan="1" colspan="1">ChIP (1 &#x000b5;g/10<sup>7</sup> cells)</td></tr><tr><td valign="top" rowspan="1" colspan="1">Recombinant DNA reagent</td><td valign="top" rowspan="1" colspan="1">pCW57.1</td><td valign="top" rowspan="1" colspan="1">Addgene</td><td valign="top" rowspan="1" colspan="1">RRID:<ext-link ext-link-type="uri" xlink:href="https://scicrunch.org/resolver/Addgene_41393">Addgene_41393</ext-link></td><td valign="top" rowspan="1" colspan="1">Inducible lentiviral expression vector</td></tr><tr><td valign="top" rowspan="1" colspan="1">Recombinant DNA reagent</td><td valign="top" rowspan="1" colspan="1">pX330-U6-Chimeric_BB-CBh-hSpCas9</td><td valign="top" rowspan="1" colspan="1">Addgene</td><td valign="top" rowspan="1" colspan="1">RRID:<ext-link ext-link-type="uri" xlink:href="https://scicrunch.org/resolver/Addgene_42230">Addgene_42230</ext-link></td><td valign="top" rowspan="1" colspan="1">CRISPR/Cas9 expression construct</td></tr><tr><td valign="top" rowspan="1" colspan="1">Sequence-based reagent</td><td valign="top" rowspan="1" colspan="1">Chr2-cl KO gRNA.1</td><td valign="top" rowspan="1" colspan="1">This paper</td><td valign="top" rowspan="1" colspan="1">Cas9 gRNA</td><td valign="top" rowspan="1" colspan="1"><named-content content-type="sequence">GCCGTTGCTCAGTCCAAATG</named-content></td></tr><tr><td valign="top" rowspan="1" colspan="1">Sequenced-based reagent</td><td valign="top" rowspan="1" colspan="1">Chr2-cl KO gRNA.2</td><td valign="top" rowspan="1" colspan="1">This paper</td><td valign="top" rowspan="1" colspan="1">Cas9 gRNA</td><td valign="top" rowspan="1" colspan="1"><named-content content-type="sequence">GATACCAGAGGTGGCCGCAAG</named-content></td></tr><tr><td valign="top" rowspan="1" colspan="1">Sequenced-based reagent</td><td valign="top" rowspan="1" colspan="1">Chr4-cl KO gRNA.1</td><td valign="top" rowspan="1" colspan="1">This paper</td><td valign="top" rowspan="1" colspan="1">Cas9 gRNA</td><td valign="top" rowspan="1" colspan="1"><named-content content-type="sequence">GCAAAGGGGCTCCTCGATGGA</named-content></td></tr><tr><td valign="top" rowspan="1" colspan="1">Sequence-based reagent</td><td valign="top" rowspan="1" colspan="1">Chr4-cl KO gRNA.2</td><td valign="top" rowspan="1" colspan="1">This paper</td><td valign="top" rowspan="1" colspan="1">Cas9 gRNA</td><td valign="top" rowspan="1" colspan="1"><named-content content-type="sequence">GTTTATGGCCGTGCTAAGGTC</named-content></td></tr><tr><td valign="top" rowspan="1" colspan="1">Sequenced-based reagent</td><td valign="top" rowspan="1" colspan="1">Chr10-cl KO gRNA.1</td><td valign="top" rowspan="1" colspan="1">This paper</td><td valign="top" rowspan="1" colspan="1">Cas9 gRNA</td><td valign="top" rowspan="1" colspan="1"><named-content content-type="sequence">GTTGCCTTCATCCCACCGTG</named-content></td></tr><tr><td valign="top" rowspan="1" colspan="1">Sequenced-based reagent</td><td valign="top" rowspan="1" colspan="1">Chr10-cl KO gRNA.2</td><td valign="top" rowspan="1" colspan="1">This paper</td><td valign="top" rowspan="1" colspan="1">Cas9 gRNA</td><td valign="top" rowspan="1" colspan="1"><named-content content-type="sequence">GAAGTTCGACTTGGACGGGCT</named-content></td></tr><tr><td valign="top" rowspan="1" colspan="1">Sequenced-based reagent</td><td valign="top" rowspan="1" colspan="1">Chr13.1-cl KO gRNA.1</td><td valign="top" rowspan="1" colspan="1">This paper</td><td valign="top" rowspan="1" colspan="1">Cas9 gRNA</td><td valign="top" rowspan="1" colspan="1"><named-content content-type="sequence">GTAACCCATCATGGGCCCTAC</named-content></td></tr><tr><td valign="top" rowspan="1" colspan="1">Sequenced-based reagent</td><td valign="top" rowspan="1" colspan="1">Chr13.1-cl KO gRNA.2</td><td valign="top" rowspan="1" colspan="1">This paper</td><td valign="top" rowspan="1" colspan="1">Cas9 gRNA</td><td valign="top" rowspan="1" colspan="1"><named-content content-type="sequence">GGACAGGTTATAGGTTTGAT</named-content></td></tr><tr><td valign="top" rowspan="1" colspan="1">Sequenced-based reagent</td><td valign="top" rowspan="1" colspan="1">Chr13.2-cl KO gRNA.1</td><td valign="top" rowspan="1" colspan="1">This paper</td><td valign="top" rowspan="1" colspan="1">Cas9 gRNA</td><td valign="top" rowspan="1" colspan="1"><named-content content-type="sequence">GGGTTTCTGAGAAACGTGTA</named-content></td></tr><tr><td valign="top" rowspan="1" colspan="1">Sequenced-based reagent</td><td valign="top" rowspan="1" colspan="1">Chr13.2-cl KO gRNA.2</td><td valign="top" rowspan="1" colspan="1">This paper</td><td valign="top" rowspan="1" colspan="1">Cas9 gRNA</td><td valign="top" rowspan="1" colspan="1"><named-content content-type="sequence">GTGTAATGAGTTCTTATATC</named-content></td></tr><tr><td valign="top" rowspan="1" colspan="1">Commercial assay or kit</td><td valign="top" rowspan="1" colspan="1">SureSelectQXT Target Enrichment kit</td><td valign="top" rowspan="1" colspan="1">Agilent</td><td valign="top" rowspan="1" colspan="1">G9681-90000</td><td valign="top" rowspan="1" colspan="1"/></tr><tr><td valign="top" rowspan="1" colspan="1">Software, algorithm</td><td valign="top" rowspan="1" colspan="1">Bowtie</td><td valign="top" rowspan="1" colspan="1"><ext-link ext-link-type="uri" xlink:href="http://bowtie-bio.sourceforge.net">http://bowtie-bio.sourceforge.net</ext-link></td><td valign="top" rowspan="1" colspan="1">RRID:<ext-link ext-link-type="uri" xlink:href="https://scicrunch.org/resolver/SCR_005476">SCR_005476</ext-link></td><td valign="top" rowspan="1" colspan="1"/></tr><tr><td valign="top" rowspan="1" colspan="1">Software, algorithm</td><td valign="top" rowspan="1" colspan="1">MACS14</td><td valign="top" rowspan="1" colspan="1"><ext-link ext-link-type="uri" xlink:href="https://bio.tools/macs">https://bio.tools/macs</ext-link></td><td valign="top" rowspan="1" colspan="1">RRID:<ext-link ext-link-type="uri" xlink:href="https://scicrunch.org/resolver/SCR_013291">SCR_013291</ext-link></td><td valign="top" rowspan="1" colspan="1"/></tr><tr><td valign="top" rowspan="1" colspan="1">Software, algorithm</td><td valign="top" rowspan="1" colspan="1">Tophat</td><td valign="top" rowspan="1" colspan="1"><ext-link ext-link-type="uri" xlink:href="https://ccb.jhu.edu">https://ccb.jhu.edu</ext-link></td><td valign="top" rowspan="1" colspan="1">RRID:<ext-link ext-link-type="uri" xlink:href="https://scicrunch.org/resolver/SCR_013035">SCR_013035</ext-link></td><td valign="top" rowspan="1" colspan="1"/></tr></tbody></table></table-wrap><sec id="s4-1"><title>Cell lines and transgenic mice</title><p>Mouse ES cells and F9 EC cells were cultivated as described previously (<xref rid="bib54" ref-type="bibr">Wolf et al., 2015b</xref>) unless stated otherwise. Chr4-cl KO ES cells originate from B6;129&#x02010; Gt(ROSA)26Sortm1(cre/ERT)Nat/J mice (Jackson lab), all other KRAB-ZFP cluster KO ES cell lines originate from JM8A3.N1 C57BL/6N-A<sup>tm1Brd</sup> ES cells (KOMP Repository). Chr2-cl KO and WT ES cells were initially grown in serum-containing media (<xref rid="bib54" ref-type="bibr">Wolf et al., 2015b</xref>) but changed to 2i media (<xref rid="bib8" ref-type="bibr">De Iaco et al., 2017</xref>) for several weeks before analysis. To generate Chr4-cl and Chr2-cl KO mice, the cluster deletions were repeated in B6 ES (KOMP repository) or R1 (Nagy lab) ES cells, respectively, and heterozygous clones were injected into B6 albino blastocysts. Chr2-cl KO mice were therefore kept on a mixed B6/Svx129/Sv-CP strain background while Chr4-cl KO mice were initially derived on a pure C57BL/6 background. For capture-seq screens, Chr4-cl KO mice were crossed with 129 &#x000d7; 1/SvJ mice (Jackson lab) to produce the founder mice for Chr4-cl KO and WT (B6/129 F1) offspring. Chr4-cl KO/WT (B6/129 F1) were also crossed with 129 &#x000d7; 1/SvJ mice to get Chr4-cl KO/WT (B6/129 F1) mice, which were intercrossed to give rise to the parents of Chr4-cl KO/KO and KO/WT (B6/129 F2) offspring.</p></sec><sec id="s4-2"><title>Generation of KRAB-ZFP expressing cell lines</title><p>KRAB-ZFP ORFs were PCR-amplified from cDNA or synthesized with codon-optimization (<xref ref-type="supplementary-material" rid="supp1">Supplementary file 1</xref>), and stably expressed with 3XFLAG or 3XHA tags in F9 EC or ES cells using <italic>Sleeping beauty</italic> transposon-based (<xref rid="bib54" ref-type="bibr">Wolf et al., 2015b</xref>) or lentiviral expression vectors (<xref rid="bib17" ref-type="bibr">Imbeault et al., 2017</xref>; <xref ref-type="supplementary-material" rid="supp1">Supplementary file 1</xref>). Cells were selected with puromycin (1 &#x000b5;g/ml) and resistant clones were pooled and further expanded for ChIP-seq.</p></sec><sec id="s4-3"><title>CRISPR/Cas9 mediated deletion of KRAB-ZFP clusters and an MMETn insertion</title><p>All gRNAs were expressed from the pX330-U6-Chimeric_BB-CBh-hSpCas9 vector (RRID:<ext-link ext-link-type="uri" xlink:href="https://scicrunch.org/resolver/Addgene_42230">Addgene_42230</ext-link>) and nucleofected into 10<sup>6</sup> ES cells using Amaxa nucleofection in the following amounts: 5 &#x000b5;g of each pX330-gRNA plasmid, 1 &#x000b5;g pPGK-puro and 500 pmoles single-stranded repair oligos (<xref ref-type="supplementary-material" rid="supp3">Supplementary file 3</xref>). One day after nucleofection, cells were kept under puromycin selection (1 &#x000b5;g/ml) for 24 hr. Individual KO and WT clones were picked 7&#x02013;8 days after nucleofection and expanded for PCR genotyping (<xref ref-type="supplementary-material" rid="supp3">Supplementary file 3</xref>).</p></sec><sec id="s4-4"><title>ChIP-seq analysis</title><p>For ChIP-seq analysis of KRAB-ZFP expressing cells, 5&#x02013;10 &#x000d7; 10<sup>7</sup> cells were crosslinked and immunoprecipitated with anti-FLAG (Sigma-Aldrich Cat# F1804, RRID:<ext-link ext-link-type="uri" xlink:href="https://scicrunch.org/resolver/AB_262044">AB_262044</ext-link>) or anti-HA (Abcam Cat# ab9110, RRID:<ext-link ext-link-type="uri" xlink:href="https://scicrunch.org/resolver/AB_307019">AB_307019</ext-link> or Covance Cat# MMS-101P-200, RRID:<ext-link ext-link-type="uri" xlink:href="https://scicrunch.org/resolver/AB_10064068">AB_10064068</ext-link>) antibody using one of two previously described protocols (<xref rid="bib37" ref-type="bibr">O'Geen et al., 2010</xref>; <xref rid="bib17" ref-type="bibr">Imbeault et al., 2017</xref>) as indicated in <xref ref-type="supplementary-material" rid="supp1">Supplementary file 1</xref>. H3K9me3 distribution in Chr4-cl, Chr10-cl, Chr13.1-cl and Chr13.2-cl KO ES cells was determined by native ChIP-seq with anti-H3K9me3 serum (Active Motif Cat# 39161, RRID:<ext-link ext-link-type="uri" xlink:href="https://scicrunch.org/resolver/AB_2532132">AB_2532132</ext-link>) as described previously (<xref rid="bib20" ref-type="bibr">Karimi et al., 2011</xref>). In Chr2-cl KO ES cells, H3K9me3 and KAP1 ChIP-seq was performed as previously described (<xref rid="bib11" ref-type="bibr">Ecco et al., 2016</xref>). In Chr4-cl KO and WT ES cells KAP1 binding was determined by endogenous tagging of KAP1 with C-terminal GFP (<xref ref-type="supplementary-material" rid="supp3">Supplementary file 3</xref>), followed by FACS to enrich for GFP-positive cells and ChIP with anti-GFP (Thermo Fisher Scientific Cat# A-11122, RRID:<ext-link ext-link-type="uri" xlink:href="https://scicrunch.org/resolver/AB_221569">AB_221569</ext-link>) using a previously described protocol (<xref rid="bib37" ref-type="bibr">O'Geen et al., 2010</xref>). For ChIP-seq analysis of active histone marks, cross-linked chromatin from ES cells or testis (from two-week old mice) was immunoprecipitated with antibodies against H3K4me3 (Abcam Cat# ab8580, RRID:<ext-link ext-link-type="uri" xlink:href="https://scicrunch.org/resolver/AB_306649">AB_306649</ext-link>), H3K4me1 (Abcam Cat# ab8895, RRID:<ext-link ext-link-type="uri" xlink:href="https://scicrunch.org/resolver/AB_306847">AB_306847</ext-link>) and H3K27ac (Abcam Cat# ab4729, RRID:<ext-link ext-link-type="uri" xlink:href="https://scicrunch.org/resolver/AB_2118291">AB_2118291</ext-link>) following the protocol developed by <xref rid="bib37" ref-type="bibr">O'Geen et al., 2010</xref> or <xref rid="bib22" ref-type="bibr">Khil et al., 2012</xref> respectively.</p><p>ChIP-seq libraries were constructed and sequenced as indicated in <xref ref-type="supplementary-material" rid="supp4">Supplementary file 4</xref>. Reads were mapped to the mm9 genome using Bowtie (RRID:<ext-link ext-link-type="uri" xlink:href="https://scicrunch.org/resolver/SCR_005476">SCR_005476</ext-link>; settings: --<monospace>best</monospace>) or Bowtie2 (<xref rid="bib24" ref-type="bibr">Langmead and Salzberg, 2012</xref>) as indicated in <xref ref-type="supplementary-material" rid="supp4">Supplementary file 4</xref>. Under these settings, reads that map to multiple genomic regions are assigned to the top-scored match and, if a set of equally good choices is encountered, a pseudo-random number is used to choose one location. Peaks were called using MACS14 (RRID:<ext-link ext-link-type="uri" xlink:href="https://scicrunch.org/resolver/SCR_013291">SCR_013291</ext-link>) under high stringency settings (p&#x0003c;1e-10, peak enrichment&#x000a0;&#x0003e;20) (<xref rid="bib56" ref-type="bibr">Zhang et al., 2008</xref>). Peaks were called both over the Input control and a FLAG or HA control ChIP (unless otherwise stated in <xref ref-type="supplementary-material" rid="supp4">Supplementary file 4</xref>) and only peaks that were called in both settings were kept for further analysis. In cases when the stringency settings did not result in at least 50 peaks, the settings were changed to medium (p&#x0003c;1e-10, peak enrichment&#x000a0;&#x0003e;10) or low (p&#x0003c;1e-5, peak enrichment&#x000a0;&#x0003e;10) stringency (<xref ref-type="supplementary-material" rid="supp4">Supplementary file 4</xref>). For further analysis, all peaks were scaled to 200 bp regions centered around the peak summits. The overlap of the scaled peaks to each repeat element in UCSC Genome Browser (RRID:<ext-link ext-link-type="uri" xlink:href="https://scicrunch.org/resolver/SCR_005780">SCR_005780</ext-link>) were calculated by using the bedfisher function (settings: -f 0.25) from BEDTools (RRID:<ext-link ext-link-type="uri" xlink:href="https://scicrunch.org/resolver/SCR_006646">SCR_006646</ext-link>). The right-tailed p-values between pair-wise comparison of each ChIP-seq peak and repeat element were extracted, and then adjusted using the Benjamini-Hochberg approach implemented in the R function p.adjust(). Binding motifs were determined using only nonrepetitive (&#x0003c;10% repeat content) peaks with MEME (<xref rid="bib1" ref-type="bibr">Bailey et al., 2009</xref>). MEME motifs were compared with in silico predicted motifs (<xref rid="bib35" ref-type="bibr">Najafabadi et al., 2015</xref>) using Tomtom (<xref rid="bib1" ref-type="bibr">Bailey et al., 2009</xref>) and considered as significantly overlapping with a False Discovery Rate (FDR) below 0.1. To find MEME and predicted motifs in repetitive peaks, we used FIMO (<xref rid="bib1" ref-type="bibr">Bailey et al., 2009</xref>). Differential H3K9me3 and KAP1 distribution in WT and Chr2-cl or Chr4-cl KO ES cells at TEs was determined by counting ChIP-seq reads overlapping annotated insertions of each TE group using BEDTools (MultiCovBed). Additionally, ChIP-seq reads were counted at the TE fraction that was bound by Chr2-cl or Chr4-cl KRAB-ZFPs (overlapping with 200 bp peaks). Count tables were concatenated and analyzed using DESeq2 (<xref rid="bib30" ref-type="bibr">Love et al., 2014</xref>). The previously published ChIP-seq datasets for KAP1 (<xref rid="bib5" ref-type="bibr">Castro-Diaz et al., 2014</xref>) and H3K9me3 (<xref rid="bib7" ref-type="bibr">Dan et al., 2014</xref>) were re-mapped using Bowtie (--<monospace>best</monospace>).</p></sec><sec id="s4-5"><title>Luciferase reporter assays</title><p>For KRAB-ZFP repression assays, double-stranded DNA oligos containing KRAB-ZFP target sequences (<xref ref-type="supplementary-material" rid="supp3">Supplementary file 3</xref>) were cloned upstream of the SV40 promoter of the pGL3-Promoter vector (Promega) between the restriction sites for NheI and XhoI. 33 ng of reporter vectors were co-transfected (Lipofectamine 2000, Thermofisher) with 33 ng pRL-SV40 (Promega) for normalization and 33 ng of transient KRAB-ZFP expression vectors (in pcDNA3.1) or empty pcDNA3.1 into 293 T cells seeded one day earlier in 96-well plates. Cells were lysed 48 hr after transfection and luciferase/Renilla luciferase activity was measured using the Dual-Luciferase Reporter Assay System (Promega). To measure the transcriptional activity of the MMETn element upstream of the <italic>Cd59a</italic> gene, fragments of varying sizes (<xref ref-type="supplementary-material" rid="supp3">Supplementary file 3</xref>) were cloned into the promoter-less pGL3-basic vector (Promega) using NheI and NcoI sites. 70 ng of reporter vectors were cotransfected with 30 ng pRL-SV40 into feeder-depleted Chr4-cl WT and KO ES cells, seeded into a gelatinized 96-well plate 2 hr before transfection. Luciferase activity was measured 48 hr after transfection as described above.</p></sec><sec id="s4-6"><title>RNA-seq analysis</title><p>Whole RNA was purified using RNeasy columns (Qiagen) with on column DNase treatment or the High Pure RNA Isolation Kit (Roche) (<xref ref-type="supplementary-material" rid="supp4">Supplementary file 4</xref>). Tissues were first lysed in TRIzol reagent (ThermoFisher) and RNA was purified after the isopropanol precipitation step using RNeasy columns (Qiagen) with on column DNase treatment. Libraries were generated using the SureSelect Strand-Specific RNA Library Prep kit (Agilent) or Illumina&#x02019;s TruSeq RNA Library Prep Kit (with polyA selection) and sequenced as 50 or 100 bp paired-end reads on an Illumina HiSeq2500 (RRID:<ext-link ext-link-type="uri" xlink:href="https://scicrunch.org/resolver/SCR_016383">SCR_016383</ext-link>) or HiSeq3000 (RRID:<ext-link ext-link-type="uri" xlink:href="https://scicrunch.org/resolver/SCR_016386">SCR_016386</ext-link>) machine (<xref ref-type="supplementary-material" rid="supp4">Supplementary file 4</xref>). RNA-seq reads were mapped to the mouse genome (mm9) using Tophat (RRID:<ext-link ext-link-type="uri" xlink:href="https://scicrunch.org/resolver/SCR_013035">SCR_013035</ext-link>; settings: --I 200000 g 1) unless otherwise stated. These settings allow each mappable read to be reported once, in case the read maps to multiple locations equally well, one match is randomly chosen. For differential transposon expression, mapped reads that overlap with TEs annotated in Repeatmasker (RRID:<ext-link ext-link-type="uri" xlink:href="https://scicrunch.org/resolver/SCR_012954">SCR_012954</ext-link>) were counted using BEDTools MultiCovBed (setting: -split). Reads mapping to multiple fragments that belong to the same TE insertion (as indicated by the repeat ID) were summed up. Only transposons with a total of at least 20 (for two biological replicates) or 30 (for three biological replicates) mapped reads across WT and KO samples were considered for differential expression analysis. Transposons within the deleted KRAB-ZFP cluster were excluded from the analysis. Read count tables were used for differential expression analysis with DESeq2 (RRID:<ext-link ext-link-type="uri" xlink:href="https://scicrunch.org/resolver/SCR_015687">SCR_015687</ext-link>). For differential gene expression analysis, reads overlapping with gene exons were counted using HTSeq-count and analyzed using DESeq2. To test if KRAB-ZFP peaks are significantly enriched near up- or down-regulated genes, a binomial test was performed. Briefly, the proportion of the peaks that are located within a certain distance up- or downstream to the TSS of genes was determined using the windowBed function of BED tools. The probability <italic>p</italic> in the binomial distribution was estimated as the fraction of all genes overlapped with KRAB-ZFP peaks. Then, given <italic>n</italic> which is the number of specific groups of genes, and <italic>x</italic> which is the number of this group of genes overlapped with peaks, the R function binom.test() was used to estimate the p-value based on right-tailed Binomial test. Finally, the adjusted p-values were determined separately for LTR and LINE retrotransposon groups using the Benjamini-Hochberg approach implemented in the R function p.adjust().</p></sec><sec id="s4-7"><title>Reduced representation bisulfite sequencing (RRBS-seq)</title><p>For RRBS-seq analysis, Chr4-cl WT and KO ES cells were grown in either standard ES cell media containing FCS or for one week in 2i media containing vitamin C as described previously (<xref rid="bib3" ref-type="bibr">Blaschke et al., 2013</xref>). Genomic DNA was purified from WT and Chr4-cl KO ES cells using the Quick-gDNA purification kit (Zymo Research) and bisulfite-converted with the NEXTflex Bisulfite-Seq Kit (Bio Scientific) using Msp1 digestion to fragment DNA. Libraries were sequenced as 50 bp paired-end reads on an Illumina HiSeq. The reads were processed using Trim Galore (--<monospace>illumina</monospace> --<monospace>paired</monospace> &#x02013;<monospace>rrbs</monospace>) to trim poor quality bases and adaptors. Additionally, the first 5 nt of R2 and the last 3 nt of R1 and R2 were trimmed. Reads were then mapped to the reference genome (mm9) using Bismark (<xref rid="bib23" ref-type="bibr">Krueger and Andrews, 2011</xref>) to extract methylation calling results. The CpG methylation pattern for each covered CpG dyads (two complementary CG dinucleotides) was calculated using a custom script (<xref ref-type="supplementary-material" rid="scode1">Source code 1</xref>: get_CpG_ML.pl). For comparison of CpG methylation between WT and Chr4-cl KO ES cells (in serum or 2i + Vitamin C conditions) only CpG sites with at least 10-fold coverage in each sample were considered for analysis.</p></sec><sec id="s4-8"><title>Retrotransposition assay</title><p>The retrotransposition vectors pCMV-MusD2, pCMV-MusD2-neoTNF and pCMV-ETnI1-neoTNF (<xref rid="bib39" ref-type="bibr">Ribet et al., 2004</xref>) were a kind gift from Dixie Mager. To partially delete the Gm13051 binding site within pCMV-MusD2-neoTNF, the vector was cut with KpnI and re-ligated using a repair oligo, leaving a 24 bp deletion within the Gm13051 binding site. The Rex2 binding site in pCMV-ETnI1-neoTNF was deleted by cutting the vector with EcoRI and XbaI followed by re-ligation using two overlapping PCR products, leaving a 45 bp deletion while maintaining the rest of the vector unchanged (see <xref ref-type="supplementary-material" rid="supp3">Supplementary file 3</xref> for primer sequences). For MusD retrotransposition assays, 5 &#x000d7; 10<sup>4</sup> HeLa cells (ATCC CCL-2) were transfected in a 24-well dish with 100 ng pCMV-MusD2-neoTNF or pCMV-MusD2-neoTNF (&#x00394;Gm13051-m) using Lipofectamine 2000. For ETn retrotransposition assays, 50 ng of pCMV-ETnI1-neoTNF or pCMV-ETnI1-neoTNF (&#x00394;Rex2) vectors were cotransfected with 50 ng pCMV-MusD2 to provide gag and pol proteins in trans. G418 (0.6 mg/ml) was added five days after transfection and cells were grown under selection until colonies were readily visible by eye. G418-resistant colonies were stained with Amido Black (Sigma).</p></sec><sec id="s4-9"><title>Capture-seq screen</title><p>To identify novel retrotransposon insertions, genomic DNA from various tissues (<xref ref-type="supplementary-material" rid="supp4">Supplementary file 4</xref>) was purified and used for library construction with target enrichment using the SureSelect<sup>QXT</sup> Target Enrichment kit (Agilent). Custom RNA capture probes were designed to hybridize with the 120 bp 5&#x02019; ends of the 5&#x02019; LTRs and the 120 bp 3&#x02019; ends of the 3&#x02019; LTR of about 600 intact (internal region flanked by two LTRs) MMETn/RLTRETN retrotransposons or of 140 RLTR4_MM/RLTR4 retrotransposons that were upregulated in Chr4-cl KO ES cells (<xref ref-type="supplementary-material" rid="fig4sdata2">Figure 4&#x02014;source data 2</xref>). Enriched libraries were sequenced on an Illumina HiSeq as paired-end 50 bp reads. R1 and R2 reads were mapped to the mm9 genome separately, using settings that only allow non-duplicated, uniquely mappable reads (Bowtie -m 1 --<monospace>best</monospace> --<monospace>strata</monospace>; samtools rmdup -s) and under settings that allow multimapping and duplicated reads (Bowtie --<monospace>best</monospace>). Of the latter, only reads that overlap (min. 50% of read) with RLTRETN, MMETn-int, ETnERV-int, ETnERV2-int or ETnERV3-int repeats (ETn) or RLTR4, RLTR4_MM-int or MuLV-int repeats (RLTR4) were kept. Only uniquely mappable reads whose paired reads were overlapping with the repeats mentioned above were used for further analysis. All ETn- and RLTR4-paired reads were then clustered (as bed files) using BEDTools (bedtools merge -i -n -d 1000) to receive a list of all potential annotated and non-annotated new ETn or RLTR4 insertion sites and all overlapping ETn- or RLTR4-paired reads were counted for each sample at each locus. Finally, all regions that were located within 1 kb of an annotated RLTRETN, MMETn-int, ETnERV-int, ETnERV2-int or ETnERV3-int repeat as well as regions overlapping with previously identified polymorphic ETn elements (<xref rid="bib36" ref-type="bibr">Nell&#x000e5;ker et al., 2012</xref>) were removed. Genomic loci with at least 10 reads per million unique ETn- or RLTR4-paired reads were considered as insertion sites. To qualify for a de-novo insertion, we allowed no called insertions in any of the other screened mice at the locus and not a single read at the locus in the ancestors of the mouse. Insertions at the same locus in at least two siblings from the same offspring were considered as germ line insertions, if the insertion was absent in the parents and mice who were not direct descendants from these siblings. Full-length sequencing of new ETn insertions was done by Sanger sequencing of short PCR products in combination with Illumina sequencing of a large PCR product (<xref ref-type="supplementary-material" rid="supp3">Supplementary file 3</xref>), followed by de-novo assembly using the Unicycler software.</p></sec></sec></body><back><sec sec-type="funding-information"><title>Funding Information</title><p>This paper was supported by the following grants:</p><list list-type="bullet"><list-item><p><funding-source><institution-wrap><institution-id institution-id-type="FundRef">http://dx.doi.org/10.13039/100009633</institution-id><institution>Eunice Kennedy Shriver National Institute of Child Health and Human Development</institution></institution-wrap></funding-source>
<award-id>1ZIAHD008933</award-id> to Todd S Macfarlan.</p></list-item><list-item><p><funding-source><institution-wrap><institution-id institution-id-type="FundRef">http://dx.doi.org/10.13039/501100001711</institution-id><institution>Swiss National Science Foundation</institution></institution-wrap></funding-source>
<award-id>310030_152879</award-id> to Didier Trono.</p></list-item><list-item><p><funding-source><institution-wrap><institution-id institution-id-type="FundRef">http://dx.doi.org/10.13039/501100001711</institution-id><institution>Swiss National Science Foundation</institution></institution-wrap></funding-source>
<award-id>310030B_173337</award-id> to Didier Trono.</p></list-item><list-item><p><funding-source><institution-wrap><institution-id institution-id-type="FundRef">http://dx.doi.org/10.13039/501100000781</institution-id><institution>European Research Council</institution></institution-wrap></funding-source>
<award-id>No. 268721</award-id> to Didier Trono.</p></list-item><list-item><p><funding-source><institution-wrap><institution-id institution-id-type="FundRef">http://dx.doi.org/10.13039/501100000781</institution-id><institution>European Research Council</institution></institution-wrap></funding-source>
<award-id>No 694658</award-id> to Didier Trono.</p></list-item></list></sec><ack id="ack"><title>Acknowledgements</title><p>We thank Alex Grinberg, Jeanne Yimdjo and Victoria Carter for generating and maintaining transgenic mice. We also thank members of the Macfarlan and Trono labs for useful discussion, Steven Coon, James Iben, Tianwei Li and Anna Malawska for NGS and computational support. This work was supported by NIH grant 1ZIAHD008933 and the NIH DDIR Innovation Award program (TSM), and by subsidies from the Swiss National Science Foundation (310030_152879 and 310030B_173337) and the European Research Council (KRABnKAP, No. 268721; Transpos-X, No. 694658) (DT).</p></ack><sec id="s5" sec-type="additional-information"><title>Additional information</title><fn-group content-type="competing-interest"><title><bold>Competing interests</bold></title><fn fn-type="COI-statement" id="conf1"><p>No competing interests declared.</p></fn></fn-group><fn-group content-type="author-contribution"><title><bold>Author contributions</bold></title><fn fn-type="con" id="con1"><p>Conceptualization, Data curation, Formal analysis, Investigation, Methodology, Writing - original draft.</p></fn><fn fn-type="con" id="con2"><p>Conceptualization, Data curation, Formal analysis, Investigation, Methodology, Writing - original draft, Writing - review and editing.</p></fn><fn fn-type="con" id="con3"><p>Conceptualization, Data curation, Software, Formal analysis, Investigation, Visualization, Methodology, Writing - review and editing.</p></fn><fn fn-type="con" id="con4"><p>Conceptualization, Formal analysis, Investigation, Writing - review and editing.</p></fn><fn fn-type="con" id="con5"><p>Investigation.</p></fn><fn fn-type="con" id="con6"><p>Investigation.</p></fn><fn fn-type="con" id="con7"><p>Data curation, Software, Formal analysis, Visualization.</p></fn><fn fn-type="con" id="con8"><p>Investigation.</p></fn><fn fn-type="con" id="con9"><p>Conceptualization, Resources, Supervision, Funding acquisition, Investigation, Writing - review and editing.</p></fn><fn fn-type="con" id="con10"><p>Conceptualization, Resources, Supervision, Funding acquisition, Investigation, Methodology, Writing - original draft, Project administration, Writing - review and editing.</p></fn></fn-group><fn-group content-type="ethics-information"><title><bold>Ethics</bold></title><fn fn-type="other"><p>Animal experimentation: All studies using mice were performed in accordance to the Guide for the Care and Use of Laboratory Animals of the NIH, under IACUC animal protocol (ASP )18-026.</p></fn></fn-group></sec><sec id="s6" sec-type="supplementary-material"><title>Additional files</title><supplementary-material content-type="local-data" id="scode1"><label>Source code 1.</label><caption><title>Custom Perl script used to get methylation pattern for each CpG dyads from Bismark methylation calling results.</title></caption><media mime-subtype="x-script.perl" mimetype="text" xlink:href="elife-56337-code1.pl" orientation="portrait" id="d36e1978" position="anchor"/></supplementary-material><supplementary-material content-type="local-data" id="supp1"><label>Supplementary file 1.</label><caption><title>Experimental parameters, gene-centered informa347tion and summary of KRAB-ZFP ChIP-seq analysis.</title></caption><media mime-subtype="xlsx" mimetype="application" xlink:href="elife-56337-supp1.xlsx" orientation="portrait" id="d36e1985" position="anchor"/></supplementary-material><supplementary-material content-type="local-data" id="supp2"><label>Supplementary file 2.</label><caption><title>Differential CpG methylation status of TEs in WT and Chr4-cl KO ES cells in serum and 2i culture conditions.</title></caption><media mime-subtype="xlsx" mimetype="application" xlink:href="elife-56337-supp2.xlsx" orientation="portrait" id="d36e1992" position="anchor"/></supplementary-material><supplementary-material content-type="local-data" id="supp3"><label>Supplementary file 3.</label><caption><title>Sequence information of used PCR primers, gRNAs and cloned oligos for luciferase repression assays.</title></caption><media mime-subtype="xlsx" mimetype="application" xlink:href="elife-56337-supp3.xlsx" orientation="portrait" id="d36e1999" position="anchor"/></supplementary-material><supplementary-material content-type="local-data" id="supp4"><label>Supplementary file 4.</label><caption><title>Overview of generated NGS data.</title></caption><media mime-subtype="xlsx" mimetype="application" xlink:href="elife-56337-supp4.xlsx" orientation="portrait" id="d36e2006" position="anchor"/></supplementary-material><supplementary-material content-type="local-data" id="transrepform"><label>Transparent reporting form</label><media mime-subtype="docx" mimetype="application" xlink:href="elife-56337-transrepform.docx" orientation="portrait" id="d36e2010" position="anchor"/></supplementary-material></sec><sec id="s7" sec-type="data-availability"><title>Data availability</title><p>All NGS data has been deposited in GEO (GSE115291). Sequences of full-length de novo ETn insertions have been deposited in the GenBank database (MH449667- MH449669).</p><p>The following datasets were generated:</p><p><element-citation publication-type="data" id="dataset1"><person-group person-group-type="author"><name><surname>Wolf</surname><given-names>G</given-names></name></person-group><year iso-8601-date="2019">2019</year><data-title>Retrotransposon reactivation and mobilization upon deletions of megabase scale KRAB zinc finger gene clusters in mice</data-title><source>NCBI Gene Expression Omnibus</source><pub-id assigning-authority="NCBI" pub-id-type="accession" xlink:href="https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE115291">GSE115291</pub-id></element-citation></p><p><element-citation publication-type="data" id="dataset4"><person-group person-group-type="author"><name><surname>Wolf</surname><given-names>G</given-names></name></person-group><year iso-8601-date="2019">2019</year><data-title>Mus musculus musculus strain C57BL/6x129X1/SvJ retrotransposon MMETn-int, complete sequence</data-title><source>NCBI GenBank</source><pub-id assigning-authority="NCBI" pub-id-type="accession" xlink:href="https://www.ncbi.nlm.nih.gov/nuccore/MH449667">MH449667</pub-id></element-citation></p><p><element-citation publication-type="data" id="dataset5"><person-group person-group-type="author"><name><surname>Wolf</surname><given-names>G</given-names></name></person-group><year iso-8601-date="2019">2019</year><data-title>Mus musculus musculus strain C57BL/6x129X1/SvJ retrotransposon MMETn-int, complete sequence</data-title><source>NCBI GenBank</source><pub-id assigning-authority="NCBI" pub-id-type="accession" xlink:href="https://www.ncbi.nlm.nih.gov/nuccore/MH449668">MH449668</pub-id></element-citation></p><p><element-citation publication-type="data" id="dataset6"><person-group person-group-type="author"><name><surname>Wolf</surname><given-names>G</given-names></name></person-group><year iso-8601-date="2019">2019</year><data-title>Mus musculus musculus strain C57BL/6x129X1/SvJ retrotransposon MMETn-int, complete sequence</data-title><source>NCBI GenBank</source><pub-id assigning-authority="NCBI" pub-id-type="accession" xlink:href="https://www.ncbi.nlm.nih.gov/nuccore/MH449669">MH449669</pub-id></element-citation></p><p>The following previously published datasets were used:</p><p><element-citation publication-type="data" id="dataset2"><person-group person-group-type="author"><name><surname>Castro-Diaz</surname><given-names>N</given-names></name><name><surname>Ecco</surname><given-names>G</given-names></name><name><surname>Coluccio</surname><given-names>A</given-names></name><name><surname>Kapopoulou</surname><given-names>A</given-names></name><name><surname>Duc</surname><given-names>J</given-names></name><name><surname>Trono</surname><given-names>D</given-names></name></person-group><year iso-8601-date="2014">2014</year><data-title>Evollutionally dynamic L1 regulation in embryonic stem cells</data-title><source>NCBI Gene Expression Omnibus</source><pub-id assigning-authority="NCBI" pub-id-type="accession" xlink:href="https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE58323">GSM1406445</pub-id></element-citation></p><p><element-citation publication-type="data" id="dataset3"><person-group person-group-type="author"><name><surname>Andrew</surname><given-names>ZX</given-names></name></person-group><year iso-8601-date="2014">2014</year><data-title>H3K9me3_ChIPSeq (Ctrl)</data-title><source>NCBI Gene Expression Omnibus</source><pub-id assigning-authority="NCBI" pub-id-type="accession" xlink:href="https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSM1327148">GSM1327148</pub-id></element-citation></p></sec><ref-list><title>References</title><ref id="bib1"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Bailey</surname><given-names>TL</given-names></name><name><surname>Boden</surname><given-names>M</given-names></name><name><surname>Buske</surname><given-names>FA</given-names></name><name><surname>Frith</surname><given-names>M</given-names></name><name><surname>Grant</surname><given-names>CE</given-names></name><name><surname>Clementi</surname><given-names>L</given-names></name><name><surname>Ren</surname><given-names>J</given-names></name><name><surname>Li</surname><given-names>WW</given-names></name><name><surname>Noble</surname><given-names>WS</given-names></name></person-group><year>2009</year><article-title>MEME SUITE: tools for motif discovery and searching</article-title><source>Nucleic Acids Research</source><volume>37</volume><fpage>W202</fpage><lpage>W208</lpage><pub-id pub-id-type="doi">10.1093/nar/gkp335</pub-id><pub-id pub-id-type="pmid">19458158</pub-id></element-citation></ref><ref id="bib2"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Baust</surname><given-names>C</given-names></name><name><surname>Gagnier</surname><given-names>L</given-names></name><name><surname>Baillie</surname><given-names>GJ</given-names></name><name><surname>Harris</surname><given-names>MJ</given-names></name><name><surname>Juriloff</surname><given-names>DM</given-names></name><name><surname>Mager</surname><given-names>DL</given-names></name></person-group><year>2003</year><article-title>Structure and expression of mobile ETnII retroelements and their coding-competent MusD relatives in the mouse</article-title><source>Journal of Virology</source><volume>77</volume><fpage>11448</fpage><lpage>11458</lpage><pub-id pub-id-type="doi">10.1128/JVI.77.21.11448-11458.2003</pub-id><pub-id pub-id-type="pmid">14557630</pub-id></element-citation></ref><ref id="bib3"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Blaschke</surname><given-names>K</given-names></name><name><surname>Ebata</surname><given-names>KT</given-names></name><name><surname>Karimi</surname><given-names>MM</given-names></name><name><surname>Zepeda-Mart&#x000ed;nez</surname><given-names>JA</given-names></name><name><surname>Goyal</surname><given-names>P</given-names></name><name><surname>Mahapatra</surname><given-names>S</given-names></name><name><surname>Tam</surname><given-names>A</given-names></name><name><surname>Laird</surname><given-names>DJ</given-names></name><name><surname>Hirst</surname><given-names>M</given-names></name><name><surname>Rao</surname><given-names>A</given-names></name><name><surname>Lorincz</surname><given-names>MC</given-names></name><name><surname>Ramalho-Santos</surname><given-names>M</given-names></name></person-group><year>2013</year><article-title>Vitamin C induces Tet-dependent DNA demethylation and a blastocyst-like state in ES cells</article-title><source>Nature</source><volume>500</volume><fpage>222</fpage><lpage>226</lpage><pub-id pub-id-type="doi">10.1038/nature12362</pub-id><pub-id pub-id-type="pmid">23812591</pub-id></element-citation></ref><ref id="bib4"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Brodziak</surname><given-names>A</given-names></name><name><surname>Zi&#x000f3;&#x00142;ko</surname><given-names>E</given-names></name><name><surname>Muc-Wierzgo&#x00144;</surname><given-names>M</given-names></name><name><surname>Nowakowska-Zajdel</surname><given-names>E</given-names></name><name><surname>Kokot</surname><given-names>T</given-names></name><name><surname>Klakla</surname><given-names>K</given-names></name></person-group><year>2012</year><article-title>The role of human endogenous retroviruses in the pathogenesis of autoimmune diseases</article-title><source>Medical Science Monitor : International Medical Journal of Experimental and Clinical Research</source><volume>18</volume><fpage>RA80</fpage><lpage>RA88</lpage><pub-id pub-id-type="doi">10.12659/msm.882892</pub-id><pub-id pub-id-type="pmid">22648263</pub-id></element-citation></ref><ref id="bib5"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Castro-Diaz</surname><given-names>N</given-names></name><name><surname>Ecco</surname><given-names>G</given-names></name><name><surname>Coluccio</surname><given-names>A</given-names></name><name><surname>Kapopoulou</surname><given-names>A</given-names></name><name><surname>Yazdanpanah</surname><given-names>B</given-names></name><name><surname>Friedli</surname><given-names>M</given-names></name><name><surname>Duc</surname><given-names>J</given-names></name><name><surname>Jang</surname><given-names>SM</given-names></name><name><surname>Turelli</surname><given-names>P</given-names></name><name><surname>Trono</surname><given-names>D</given-names></name></person-group><year>2014</year><article-title>Evolutionally dynamic L1 regulation in embryonic stem cells</article-title><source>Genes &#x00026; Development</source><volume>28</volume><fpage>1397</fpage><lpage>1409</lpage><pub-id pub-id-type="doi">10.1101/gad.241661.114</pub-id><pub-id pub-id-type="pmid">24939876</pub-id></element-citation></ref><ref id="bib6"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Chuong</surname><given-names>EB</given-names></name><name><surname>Elde</surname><given-names>NC</given-names></name><name><surname>Feschotte</surname><given-names>C</given-names></name></person-group><year>2016</year><article-title>Regulatory evolution of innate immunity through co-option of endogenous retroviruses</article-title><source>Science</source><volume>351</volume><fpage>1083</fpage><lpage>1087</lpage><pub-id pub-id-type="doi">10.1126/science.aad5497</pub-id><pub-id pub-id-type="pmid">26941318</pub-id></element-citation></ref><ref id="bib7"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Dan</surname><given-names>J</given-names></name><name><surname>Liu</surname><given-names>Y</given-names></name><name><surname>Liu</surname><given-names>N</given-names></name><name><surname>Chiourea</surname><given-names>M</given-names></name><name><surname>Okuka</surname><given-names>M</given-names></name><name><surname>Wu</surname><given-names>T</given-names></name><name><surname>Ye</surname><given-names>X</given-names></name><name><surname>Mou</surname><given-names>C</given-names></name><name><surname>Wang</surname><given-names>L</given-names></name><name><surname>Wang</surname><given-names>L</given-names></name><name><surname>Yin</surname><given-names>Y</given-names></name><name><surname>Yuan</surname><given-names>J</given-names></name><name><surname>Zuo</surname><given-names>B</given-names></name><name><surname>Wang</surname><given-names>F</given-names></name><name><surname>Li</surname><given-names>Z</given-names></name><name><surname>Pan</surname><given-names>X</given-names></name><name><surname>Yin</surname><given-names>Z</given-names></name><name><surname>Chen</surname><given-names>L</given-names></name><name><surname>Keefe</surname><given-names>DL</given-names></name><name><surname>Gagos</surname><given-names>S</given-names></name><name><surname>Xiao</surname><given-names>A</given-names></name><name><surname>Liu</surname><given-names>L</given-names></name></person-group><year>2014</year><article-title>Rif1 maintains telomere length homeostasis of ESCs by mediating heterochromatin silencing</article-title><source>Developmental Cell</source><volume>29</volume><fpage>7</fpage><lpage>19</lpage><pub-id pub-id-type="doi">10.1016/j.devcel.2014.03.004</pub-id><pub-id pub-id-type="pmid">24735877</pub-id></element-citation></ref><ref id="bib8"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>De Iaco</surname><given-names>A</given-names></name><name><surname>Planet</surname><given-names>E</given-names></name><name><surname>Coluccio</surname><given-names>A</given-names></name><name><surname>Verp</surname><given-names>S</given-names></name><name><surname>Duc</surname><given-names>J</given-names></name><name><surname>Trono</surname><given-names>D</given-names></name></person-group><year>2017</year><article-title>DUX-family transcription factors regulate zygotic genome activation in placental mammals</article-title><source>Nature Genetics</source><volume>49</volume><fpage>941</fpage><lpage>945</lpage><pub-id pub-id-type="doi">10.1038/ng.3858</pub-id><pub-id pub-id-type="pmid">28459456</pub-id></element-citation></ref><ref id="bib9"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Deniz</surname><given-names>&#x000d6;</given-names></name><name><surname>de la Rica</surname><given-names>L</given-names></name><name><surname>Cheng</surname><given-names>KCL</given-names></name><name><surname>Spensberger</surname><given-names>D</given-names></name><name><surname>Branco</surname><given-names>MR</given-names></name></person-group><year>2018</year><article-title>SETDB1 prevents TET2-dependent activation of IAP retroelements in na&#x000ef;ve embryonic stem cells</article-title><source>Genome Biology</source><volume>19</volume><elocation-id>6</elocation-id><pub-id pub-id-type="doi">10.1186/s13059-017-1376-y</pub-id><pub-id pub-id-type="pmid">29351814</pub-id></element-citation></ref><ref id="bib10"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Dewannieux</surname><given-names>M</given-names></name><name><surname>Heidmann</surname><given-names>T</given-names></name></person-group><year>2013</year><article-title>Endogenous retroviruses: acquisition, amplification and taming of genome invaders</article-title><source>Current Opinion in Virology</source><volume>3</volume><fpage>646</fpage><lpage>656</lpage><pub-id pub-id-type="doi">10.1016/j.coviro.2013.08.005</pub-id><pub-id pub-id-type="pmid">24004725</pub-id></element-citation></ref><ref id="bib11"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Ecco</surname><given-names>G</given-names></name><name><surname>Cassano</surname><given-names>M</given-names></name><name><surname>Kauzlaric</surname><given-names>A</given-names></name><name><surname>Duc</surname><given-names>J</given-names></name><name><surname>Coluccio</surname><given-names>A</given-names></name><name><surname>Offner</surname><given-names>S</given-names></name><name><surname>Imbeault</surname><given-names>M</given-names></name><name><surname>Rowe</surname><given-names>HM</given-names></name><name><surname>Turelli</surname><given-names>P</given-names></name><name><surname>Trono</surname><given-names>D</given-names></name></person-group><year>2016</year><article-title>Transposable elements and their KRAB-ZFP controllers regulate gene expression in adult tissues</article-title><source>Developmental Cell</source><volume>36</volume><fpage>611</fpage><lpage>623</lpage><pub-id pub-id-type="doi">10.1016/j.devcel.2016.02.024</pub-id><pub-id pub-id-type="pmid">27003935</pub-id></element-citation></ref><ref id="bib12"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Ecco</surname><given-names>G</given-names></name><name><surname>Imbeault</surname><given-names>M</given-names></name><name><surname>Trono</surname><given-names>D</given-names></name></person-group><year>2017</year><article-title>KRAB zinc finger proteins</article-title><source>Development</source><volume>144</volume><fpage>2719</fpage><lpage>2729</lpage><pub-id pub-id-type="doi">10.1242/dev.132605</pub-id><pub-id pub-id-type="pmid">28765213</pub-id></element-citation></ref><ref id="bib13"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Frank</surname><given-names>JA</given-names></name><name><surname>Feschotte</surname><given-names>C</given-names></name></person-group><year>2017</year><article-title>Co-option of endogenous viral sequences for host cell function</article-title><source>Current Opinion in Virology</source><volume>25</volume><fpage>81</fpage><lpage>89</lpage><pub-id pub-id-type="doi">10.1016/j.coviro.2017.07.021</pub-id><pub-id pub-id-type="pmid">28818736</pub-id></element-citation></ref><ref id="bib14"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Gagnier</surname><given-names>L</given-names></name><name><surname>Belancio</surname><given-names>VP</given-names></name><name><surname>Mager</surname><given-names>DL</given-names></name></person-group><year>2019</year><article-title>Mouse germ line mutations due to retrotransposon insertions</article-title><source>Mobile DNA</source><volume>10</volume><elocation-id>15</elocation-id><pub-id pub-id-type="doi">10.1186/s13100-019-0157-4</pub-id><pub-id pub-id-type="pmid">31011371</pub-id></element-citation></ref><ref id="bib15"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Groner</surname><given-names>AC</given-names></name><name><surname>Meylan</surname><given-names>S</given-names></name><name><surname>Ciuffi</surname><given-names>A</given-names></name><name><surname>Zangger</surname><given-names>N</given-names></name><name><surname>Ambrosini</surname><given-names>G</given-names></name><name><surname>D&#x000e9;nervaud</surname><given-names>N</given-names></name><name><surname>Bucher</surname><given-names>P</given-names></name><name><surname>Trono</surname><given-names>D</given-names></name></person-group><year>2010</year><article-title>KRAB-zinc finger proteins and KAP1 can mediate long-range transcriptional repression through heterochromatin spreading</article-title><source>PLOS Genetics</source><volume>6</volume><elocation-id>e1000869</elocation-id><pub-id pub-id-type="doi">10.1371/journal.pgen.1000869</pub-id><pub-id pub-id-type="pmid">20221260</pub-id></element-citation></ref><ref id="bib16"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Hancks</surname><given-names>DC</given-names></name><name><surname>Kazazian</surname><given-names>HH</given-names></name></person-group><year>2016</year><article-title>Roles for retrotransposon insertions in human disease</article-title><source>Mobile DNA</source><volume>7</volume><elocation-id>9</elocation-id><pub-id pub-id-type="doi">10.1186/s13100-016-0065-9</pub-id><pub-id pub-id-type="pmid">27158268</pub-id></element-citation></ref><ref id="bib17"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Imbeault</surname><given-names>M</given-names></name><name><surname>Helleboid</surname><given-names>PY</given-names></name><name><surname>Trono</surname><given-names>D</given-names></name></person-group><year>2017</year><article-title>KRAB zinc-finger proteins contribute to the evolution of gene regulatory networks</article-title><source>Nature</source><volume>543</volume><fpage>550</fpage><lpage>554</lpage><pub-id pub-id-type="doi">10.1038/nature21683</pub-id><pub-id pub-id-type="pmid">28273063</pub-id></element-citation></ref><ref id="bib18"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Jacobs</surname><given-names>FM</given-names></name><name><surname>Greenberg</surname><given-names>D</given-names></name><name><surname>Nguyen</surname><given-names>N</given-names></name><name><surname>Haeussler</surname><given-names>M</given-names></name><name><surname>Ewing</surname><given-names>AD</given-names></name><name><surname>Katzman</surname><given-names>S</given-names></name><name><surname>Paten</surname><given-names>B</given-names></name><name><surname>Salama</surname><given-names>SR</given-names></name><name><surname>Haussler</surname><given-names>D</given-names></name></person-group><year>2014</year><article-title>An evolutionary arms race between KRAB zinc-finger genes ZNF91/93 and SVA/L1 retrotransposons</article-title><source>Nature</source><volume>516</volume><fpage>242</fpage><lpage>245</lpage><pub-id pub-id-type="doi">10.1038/nature13760</pub-id><pub-id pub-id-type="pmid">25274305</pub-id></element-citation></ref><ref id="bib19"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Kano</surname><given-names>H</given-names></name><name><surname>Kurahashi</surname><given-names>H</given-names></name><name><surname>Toda</surname><given-names>T</given-names></name></person-group><year>2007</year><article-title>Genetically regulated epigenetic transcriptional activation of retrotransposon insertion confers mouse dactylaplasia phenotype</article-title><source>PNAS</source><volume>104</volume><fpage>19034</fpage><lpage>19039</lpage><pub-id pub-id-type="doi">10.1073/pnas.0705483104</pub-id><pub-id pub-id-type="pmid">17984064</pub-id></element-citation></ref><ref id="bib20"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Karimi</surname><given-names>MM</given-names></name><name><surname>Goyal</surname><given-names>P</given-names></name><name><surname>Maksakova</surname><given-names>IA</given-names></name><name><surname>Bilenky</surname><given-names>M</given-names></name><name><surname>Leung</surname><given-names>D</given-names></name><name><surname>Tang</surname><given-names>JX</given-names></name><name><surname>Shinkai</surname><given-names>Y</given-names></name><name><surname>Mager</surname><given-names>DL</given-names></name><name><surname>Jones</surname><given-names>S</given-names></name><name><surname>Hirst</surname><given-names>M</given-names></name><name><surname>Lorincz</surname><given-names>MC</given-names></name></person-group><year>2011</year><article-title>DNA methylation and SETDB1/H3K9me3 regulate predominantly distinct sets of genes, retroelements, and chimeric transcripts in mESCs</article-title><source>Cell Stem Cell</source><volume>8</volume><fpage>676</fpage><lpage>687</lpage><pub-id pub-id-type="doi">10.1016/j.stem.2011.04.004</pub-id><pub-id pub-id-type="pmid">21624812</pub-id></element-citation></ref><ref id="bib21"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Kauzlaric</surname><given-names>A</given-names></name><name><surname>Ecco</surname><given-names>G</given-names></name><name><surname>Cassano</surname><given-names>M</given-names></name><name><surname>Duc</surname><given-names>J</given-names></name><name><surname>Imbeault</surname><given-names>M</given-names></name><name><surname>Trono</surname><given-names>D</given-names></name></person-group><year>2017</year><article-title>The mouse genome displays highly dynamic populations of KRAB-zinc finger protein genes and related genetic units</article-title><source>PLOS ONE</source><volume>12</volume><elocation-id>e0173746</elocation-id><pub-id pub-id-type="doi">10.1371/journal.pone.0173746</pub-id><pub-id pub-id-type="pmid">28334004</pub-id></element-citation></ref><ref id="bib22"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Khil</surname><given-names>PP</given-names></name><name><surname>Smagulova</surname><given-names>F</given-names></name><name><surname>Brick</surname><given-names>KM</given-names></name><name><surname>Camerini-Otero</surname><given-names>RD</given-names></name><name><surname>Petukhova</surname><given-names>GV</given-names></name></person-group><year>2012</year><article-title>Sensitive mapping of recombination hotspots using sequencing-based detection of ssDNA</article-title><source>Genome Research</source><volume>22</volume><fpage>957</fpage><lpage>965</lpage><pub-id pub-id-type="doi">10.1101/gr.130583.111</pub-id><pub-id pub-id-type="pmid">22367190</pub-id></element-citation></ref><ref id="bib23"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Krueger</surname><given-names>F</given-names></name><name><surname>Andrews</surname><given-names>SR</given-names></name></person-group><year>2011</year><article-title>Bismark: a flexible aligner and methylation caller for Bisulfite-Seq applications</article-title><source>Bioinformatics</source><volume>27</volume><fpage>1571</fpage><lpage>1572</lpage><pub-id pub-id-type="doi">10.1093/bioinformatics/btr167</pub-id><pub-id pub-id-type="pmid">21493656</pub-id></element-citation></ref><ref id="bib24"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Langmead</surname><given-names>B</given-names></name><name><surname>Salzberg</surname><given-names>SL</given-names></name></person-group><year>2012</year><article-title>Fast gapped-read alignment with bowtie 2</article-title><source>Nature Methods</source><volume>9</volume><fpage>357</fpage><lpage>359</lpage><pub-id pub-id-type="doi">10.1038/nmeth.1923</pub-id><pub-id pub-id-type="pmid">22388286</pub-id></element-citation></ref><ref id="bib25"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Legiewicz</surname><given-names>M</given-names></name><name><surname>Zolotukhin</surname><given-names>AS</given-names></name><name><surname>Pilkington</surname><given-names>GR</given-names></name><name><surname>Purzycka</surname><given-names>KJ</given-names></name><name><surname>Mitchell</surname><given-names>M</given-names></name><name><surname>Uranishi</surname><given-names>H</given-names></name><name><surname>Bear</surname><given-names>J</given-names></name><name><surname>Pavlakis</surname><given-names>GN</given-names></name><name><surname>Le Grice</surname><given-names>SF</given-names></name><name><surname>Felber</surname><given-names>BK</given-names></name></person-group><year>2010</year><article-title>The RNA transport element of the murine <italic>musD</italic> retrotransposon requires long-range intramolecular interactions for function</article-title><source>Journal of Biological Chemistry</source><volume>285</volume><fpage>42097</fpage><lpage>42104</lpage><pub-id pub-id-type="doi">10.1074/jbc.M110.182840</pub-id><pub-id pub-id-type="pmid">20978285</pub-id></element-citation></ref><ref id="bib26"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Lehoczky</surname><given-names>JA</given-names></name><name><surname>Thomas</surname><given-names>PE</given-names></name><name><surname>Patrie</surname><given-names>KM</given-names></name><name><surname>Owens</surname><given-names>KM</given-names></name><name><surname>Villarreal</surname><given-names>LM</given-names></name><name><surname>Galbraith</surname><given-names>K</given-names></name><name><surname>Washburn</surname><given-names>J</given-names></name><name><surname>Johnson</surname><given-names>CN</given-names></name><name><surname>Gavino</surname><given-names>B</given-names></name><name><surname>Borowsky</surname><given-names>AD</given-names></name><name><surname>Millen</surname><given-names>KJ</given-names></name><name><surname>Wakenight</surname><given-names>P</given-names></name><name><surname>Law</surname><given-names>W</given-names></name><name><surname>Van Keuren</surname><given-names>ML</given-names></name><name><surname>Gavrilina</surname><given-names>G</given-names></name><name><surname>Hughes</surname><given-names>ED</given-names></name><name><surname>Saunders</surname><given-names>TL</given-names></name><name><surname>Brihn</surname><given-names>L</given-names></name><name><surname>Nadeau</surname><given-names>JH</given-names></name><name><surname>Innis</surname><given-names>JW</given-names></name></person-group><year>2013</year><article-title>A novel intergenic ETnII-&#x003b2; insertion mutation causes multiple malformations in Polypodia mice</article-title><source>PLOS Genetics</source><volume>9</volume><elocation-id>e1003967</elocation-id><pub-id pub-id-type="doi">10.1371/journal.pgen.1003967</pub-id><pub-id pub-id-type="pmid">24339789</pub-id></element-citation></ref><ref id="bib27"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Leung</surname><given-names>D</given-names></name><name><surname>Du</surname><given-names>T</given-names></name><name><surname>Wagner</surname><given-names>U</given-names></name><name><surname>Xie</surname><given-names>W</given-names></name><name><surname>Lee</surname><given-names>AY</given-names></name><name><surname>Goyal</surname><given-names>P</given-names></name><name><surname>Li</surname><given-names>Y</given-names></name><name><surname>Szulwach</surname><given-names>KE</given-names></name><name><surname>Jin</surname><given-names>P</given-names></name><name><surname>Lorincz</surname><given-names>MC</given-names></name><name><surname>Ren</surname><given-names>B</given-names></name></person-group><year>2014</year><article-title>Regulation of DNA methylation turnover at LTR retrotransposons and imprinted loci by the histone methyltransferase Setdb1</article-title><source>PNAS</source><volume>111</volume><fpage>6690</fpage><lpage>6695</lpage><pub-id pub-id-type="doi">10.1073/pnas.1322273111</pub-id><pub-id pub-id-type="pmid">24757056</pub-id></element-citation></ref><ref id="bib28"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Lilue</surname><given-names>J</given-names></name><name><surname>Doran</surname><given-names>AG</given-names></name><name><surname>Fiddes</surname><given-names>IT</given-names></name><name><surname>Abrudan</surname><given-names>M</given-names></name><name><surname>Armstrong</surname><given-names>J</given-names></name><name><surname>Bennett</surname><given-names>R</given-names></name><name><surname>Chow</surname><given-names>W</given-names></name><name><surname>Collins</surname><given-names>J</given-names></name><name><surname>Collins</surname><given-names>S</given-names></name><name><surname>Czechanski</surname><given-names>A</given-names></name><name><surname>Danecek</surname><given-names>P</given-names></name><name><surname>Diekhans</surname><given-names>M</given-names></name><name><surname>Dolle</surname><given-names>DD</given-names></name><name><surname>Dunn</surname><given-names>M</given-names></name><name><surname>Durbin</surname><given-names>R</given-names></name><name><surname>Earl</surname><given-names>D</given-names></name><name><surname>Ferguson-Smith</surname><given-names>A</given-names></name><name><surname>Flicek</surname><given-names>P</given-names></name><name><surname>Flint</surname><given-names>J</given-names></name><name><surname>Frankish</surname><given-names>A</given-names></name><name><surname>Fu</surname><given-names>B</given-names></name><name><surname>Gerstein</surname><given-names>M</given-names></name><name><surname>Gilbert</surname><given-names>J</given-names></name><name><surname>Goodstadt</surname><given-names>L</given-names></name><name><surname>Harrow</surname><given-names>J</given-names></name><name><surname>Howe</surname><given-names>K</given-names></name><name><surname>Ibarra-Soria</surname><given-names>X</given-names></name><name><surname>Kolmogorov</surname><given-names>M</given-names></name><name><surname>Lelliott</surname><given-names>CJ</given-names></name><name><surname>Logan</surname><given-names>DW</given-names></name><name><surname>Loveland</surname><given-names>J</given-names></name><name><surname>Mathews</surname><given-names>CE</given-names></name><name><surname>Mott</surname><given-names>R</given-names></name><name><surname>Muir</surname><given-names>P</given-names></name><name><surname>Nachtweide</surname><given-names>S</given-names></name><name><surname>Navarro</surname><given-names>FCP</given-names></name><name><surname>Odom</surname><given-names>DT</given-names></name><name><surname>Park</surname><given-names>N</given-names></name><name><surname>Pelan</surname><given-names>S</given-names></name><name><surname>Pham</surname><given-names>SK</given-names></name><name><surname>Quail</surname><given-names>M</given-names></name><name><surname>Reinholdt</surname><given-names>L</given-names></name><name><surname>Romoth</surname><given-names>L</given-names></name><name><surname>Shirley</surname><given-names>L</given-names></name><name><surname>Sisu</surname><given-names>C</given-names></name><name><surname>Sjoberg-Herrera</surname><given-names>M</given-names></name><name><surname>Stanke</surname><given-names>M</given-names></name><name><surname>Steward</surname><given-names>C</given-names></name><name><surname>Thomas</surname><given-names>M</given-names></name><name><surname>Threadgold</surname><given-names>G</given-names></name><name><surname>Thybert</surname><given-names>D</given-names></name><name><surname>Torrance</surname><given-names>J</given-names></name><name><surname>Wong</surname><given-names>K</given-names></name><name><surname>Wood</surname><given-names>J</given-names></name><name><surname>Yalcin</surname><given-names>B</given-names></name><name><surname>Yang</surname><given-names>F</given-names></name><name><surname>Adams</surname><given-names>DJ</given-names></name><name><surname>Paten</surname><given-names>B</given-names></name><name><surname>Keane</surname><given-names>TM</given-names></name></person-group><year>2018</year><article-title>Sixteen diverse laboratory mouse reference genomes define strain-specific haplotypes and novel functional loci</article-title><source>Nature Genetics</source><volume>50</volume><fpage>1574</fpage><lpage>1583</lpage><pub-id pub-id-type="doi">10.1038/s41588-018-0223-8</pub-id><pub-id pub-id-type="pmid">30275530</pub-id></element-citation></ref><ref id="bib29"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Liu</surname><given-names>S</given-names></name><name><surname>Brind'Amour</surname><given-names>J</given-names></name><name><surname>Karimi</surname><given-names>MM</given-names></name><name><surname>Shirane</surname><given-names>K</given-names></name><name><surname>Bogutz</surname><given-names>A</given-names></name><name><surname>Lefebvre</surname><given-names>L</given-names></name><name><surname>Sasaki</surname><given-names>H</given-names></name><name><surname>Shinkai</surname><given-names>Y</given-names></name><name><surname>Lorincz</surname><given-names>MC</given-names></name></person-group><year>2014</year><article-title><italic>Setdb1</italic> is required for germline development and silencing of H3K9me3-marked endogenous retroviruses in primordial germ cells</article-title><source>Genes &#x00026; Development</source><volume>28</volume><fpage>2041</fpage><lpage>2055</lpage><pub-id pub-id-type="doi">10.1101/gad.244848.114</pub-id><pub-id pub-id-type="pmid">25228647</pub-id></element-citation></ref><ref id="bib30"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Love</surname><given-names>MI</given-names></name><name><surname>Huber</surname><given-names>W</given-names></name><name><surname>Anders</surname><given-names>S</given-names></name></person-group><year>2014</year><article-title>Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2</article-title><source>Genome Biology</source><volume>15</volume><elocation-id>550</elocation-id><pub-id pub-id-type="doi">10.1186/s13059-014-0550-8</pub-id><pub-id pub-id-type="pmid">25516281</pub-id></element-citation></ref><ref id="bib31"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Lugani</surname><given-names>F</given-names></name><name><surname>Arora</surname><given-names>R</given-names></name><name><surname>Papeta</surname><given-names>N</given-names></name><name><surname>Patel</surname><given-names>A</given-names></name><name><surname>Zheng</surname><given-names>Z</given-names></name><name><surname>Sterken</surname><given-names>R</given-names></name><name><surname>Singer</surname><given-names>RA</given-names></name><name><surname>Caridi</surname><given-names>G</given-names></name><name><surname>Mendelsohn</surname><given-names>C</given-names></name><name><surname>Sussel</surname><given-names>L</given-names></name><name><surname>Papaioannou</surname><given-names>VE</given-names></name><name><surname>Gharavi</surname><given-names>AG</given-names></name></person-group><year>2013</year><article-title>A retrotransposon insertion in the 5' regulatory domain of Ptf1a results in ectopic gene expression and multiple congenital defects in Danforth's short tail mouse</article-title><source>PLOS Genetics</source><volume>9</volume><elocation-id>e1003206</elocation-id><pub-id pub-id-type="doi">10.1371/journal.pgen.1003206</pub-id><pub-id pub-id-type="pmid">23437001</pub-id></element-citation></ref><ref id="bib32"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Macfarlan</surname><given-names>TS</given-names></name><name><surname>Gifford</surname><given-names>WD</given-names></name><name><surname>Driscoll</surname><given-names>S</given-names></name><name><surname>Lettieri</surname><given-names>K</given-names></name><name><surname>Rowe</surname><given-names>HM</given-names></name><name><surname>Bonanomi</surname><given-names>D</given-names></name><name><surname>Firth</surname><given-names>A</given-names></name><name><surname>Singer</surname><given-names>O</given-names></name><name><surname>Trono</surname><given-names>D</given-names></name><name><surname>Pfaff</surname><given-names>SL</given-names></name></person-group><year>2012</year><article-title>Embryonic stem cell potency fluctuates with endogenous retrovirus activity</article-title><source>Nature</source><volume>487</volume><fpage>57</fpage><lpage>63</lpage><pub-id pub-id-type="doi">10.1038/nature11244</pub-id><pub-id pub-id-type="pmid">22722858</pub-id></element-citation></ref><ref id="bib33"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Maksakova</surname><given-names>IA</given-names></name><name><surname>Romanish</surname><given-names>MT</given-names></name><name><surname>Gagnier</surname><given-names>L</given-names></name><name><surname>Dunn</surname><given-names>CA</given-names></name><name><surname>van de Lagemaat</surname><given-names>LN</given-names></name><name><surname>Mager</surname><given-names>DL</given-names></name></person-group><year>2006</year><article-title>Retroviral elements and their hosts: insertional mutagenesis in the mouse germ line</article-title><source>PLOS Genetics</source><volume>2</volume><elocation-id>e2</elocation-id><pub-id pub-id-type="doi">10.1371/journal.pgen.0020002</pub-id><pub-id pub-id-type="pmid">16440055</pub-id></element-citation></ref><ref id="bib34"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Matsui</surname><given-names>T</given-names></name><name><surname>Leung</surname><given-names>D</given-names></name><name><surname>Miyashita</surname><given-names>H</given-names></name><name><surname>Maksakova</surname><given-names>IA</given-names></name><name><surname>Miyachi</surname><given-names>H</given-names></name><name><surname>Kimura</surname><given-names>H</given-names></name><name><surname>Tachibana</surname><given-names>M</given-names></name><name><surname>Lorincz</surname><given-names>MC</given-names></name><name><surname>Shinkai</surname><given-names>Y</given-names></name></person-group><year>2010</year><article-title>Proviral silencing in embryonic stem cells requires the histone methyltransferase ESET</article-title><source>Nature</source><volume>464</volume><fpage>927</fpage><lpage>931</lpage><pub-id pub-id-type="doi">10.1038/nature08858</pub-id><pub-id pub-id-type="pmid">20164836</pub-id></element-citation></ref><ref id="bib35"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Najafabadi</surname><given-names>HS</given-names></name><name><surname>Mnaimneh</surname><given-names>S</given-names></name><name><surname>Schmitges</surname><given-names>FW</given-names></name><name><surname>Garton</surname><given-names>M</given-names></name><name><surname>Lam</surname><given-names>KN</given-names></name><name><surname>Yang</surname><given-names>A</given-names></name><name><surname>Albu</surname><given-names>M</given-names></name><name><surname>Weirauch</surname><given-names>MT</given-names></name><name><surname>Radovani</surname><given-names>E</given-names></name><name><surname>Kim</surname><given-names>PM</given-names></name><name><surname>Greenblatt</surname><given-names>J</given-names></name><name><surname>Frey</surname><given-names>BJ</given-names></name><name><surname>Hughes</surname><given-names>TR</given-names></name></person-group><year>2015</year><article-title>C2H2 zinc finger proteins greatly expand the human regulatory lexicon</article-title><source>Nature Biotechnology</source><volume>33</volume><fpage>555</fpage><lpage>562</lpage><pub-id pub-id-type="doi">10.1038/nbt.3128</pub-id><pub-id pub-id-type="pmid">25690854</pub-id></element-citation></ref><ref id="bib36"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Nell&#x000e5;ker</surname><given-names>C</given-names></name><name><surname>Keane</surname><given-names>TM</given-names></name><name><surname>Yalcin</surname><given-names>B</given-names></name><name><surname>Wong</surname><given-names>K</given-names></name><name><surname>Agam</surname><given-names>A</given-names></name><name><surname>Belgard</surname><given-names>TG</given-names></name><name><surname>Flint</surname><given-names>J</given-names></name><name><surname>Adams</surname><given-names>DJ</given-names></name><name><surname>Frankel</surname><given-names>WN</given-names></name><name><surname>Ponting</surname><given-names>CP</given-names></name></person-group><year>2012</year><article-title>The genomic landscape shaped by selection on transposable elements across 18 mouse strains</article-title><source>Genome Biology</source><volume>13</volume><elocation-id>R45</elocation-id><pub-id pub-id-type="doi">10.1186/gb-2012-13-6-r45</pub-id><pub-id pub-id-type="pmid">22703977</pub-id></element-citation></ref><ref id="bib37"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>O'Geen</surname><given-names>H</given-names></name><name><surname>Frietze</surname><given-names>S</given-names></name><name><surname>Farnham</surname><given-names>PJ</given-names></name></person-group><year>2010</year><article-title>Using ChIP-seq technology to identify targets of zinc finger transcription factors</article-title><source>Methods in Molecular Biology</source><volume>649</volume><fpage>437</fpage><lpage>455</lpage><pub-id pub-id-type="doi">10.1007/978-1-60761-753-2_27</pub-id><pub-id pub-id-type="pmid">20680851</pub-id></element-citation></ref><ref id="bib38"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Patel</surname><given-names>A</given-names></name><name><surname>Yang</surname><given-names>P</given-names></name><name><surname>Tinkham</surname><given-names>M</given-names></name><name><surname>Pradhan</surname><given-names>M</given-names></name><name><surname>Sun</surname><given-names>M-A</given-names></name><name><surname>Wang</surname><given-names>Y</given-names></name><name><surname>Hoang</surname><given-names>D</given-names></name><name><surname>Wolf</surname><given-names>G</given-names></name><name><surname>Horton</surname><given-names>JR</given-names></name><name><surname>Zhang</surname><given-names>X</given-names></name><name><surname>Macfarlan</surname><given-names>T</given-names></name><name><surname>Cheng</surname><given-names>X</given-names></name></person-group><year>2018</year><article-title>DNA conformation induces adaptable binding by tandem zinc finger proteins</article-title><source>Cell</source><volume>173</volume><fpage>221</fpage><lpage>233</lpage><pub-id pub-id-type="doi">10.1016/j.cell.2018.02.058</pub-id><pub-id pub-id-type="pmid">29551271</pub-id></element-citation></ref><ref id="bib39"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Ribet</surname><given-names>D</given-names></name><name><surname>Dewannieux</surname><given-names>M</given-names></name><name><surname>Heidmann</surname><given-names>T</given-names></name></person-group><year>2004</year><article-title>An active murine transposon family pair: retrotransposition of "master" MusD copies and ETn trans-mobilization</article-title><source>Genome Research</source><volume>14</volume><fpage>2261</fpage><lpage>2267</lpage><pub-id pub-id-type="doi">10.1101/gr.2924904</pub-id><pub-id pub-id-type="pmid">15479948</pub-id></element-citation></ref><ref id="bib40"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Richardson</surname><given-names>SR</given-names></name><name><surname>Gerdes</surname><given-names>P</given-names></name><name><surname>Gerhardt</surname><given-names>DJ</given-names></name><name><surname>Sanchez-Luque</surname><given-names>FJ</given-names></name><name><surname>Bodea</surname><given-names>GO</given-names></name><name><surname>Mu&#x000f1;oz-Lopez</surname><given-names>M</given-names></name><name><surname>Jesuadian</surname><given-names>JS</given-names></name><name><surname>Kempen</surname><given-names>MHC</given-names></name><name><surname>Carreira</surname><given-names>PE</given-names></name><name><surname>Jeddeloh</surname><given-names>JA</given-names></name><name><surname>Garcia-Perez</surname><given-names>JL</given-names></name><name><surname>Kazazian</surname><given-names>HH</given-names></name><name><surname>Ewing</surname><given-names>AD</given-names></name><name><surname>Faulkner</surname><given-names>GJ</given-names></name></person-group><year>2017</year><article-title>Heritable L1 retrotransposition in the mouse primordial germline and early embryo</article-title><source>Genome Research</source><volume>27</volume><fpage>1395</fpage><lpage>1405</lpage><pub-id pub-id-type="doi">10.1101/gr.219022.116</pub-id><pub-id pub-id-type="pmid">28483779</pub-id></element-citation></ref><ref id="bib41"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Rowe</surname><given-names>HM</given-names></name><name><surname>Jakobsson</surname><given-names>J</given-names></name><name><surname>Mesnard</surname><given-names>D</given-names></name><name><surname>Rougemont</surname><given-names>J</given-names></name><name><surname>Reynard</surname><given-names>S</given-names></name><name><surname>Aktas</surname><given-names>T</given-names></name><name><surname>Maillard</surname><given-names>PV</given-names></name><name><surname>Layard-Liesching</surname><given-names>H</given-names></name><name><surname>Verp</surname><given-names>S</given-names></name><name><surname>Marquis</surname><given-names>J</given-names></name><name><surname>Spitz</surname><given-names>F</given-names></name><name><surname>Constam</surname><given-names>DB</given-names></name><name><surname>Trono</surname><given-names>D</given-names></name></person-group><year>2010</year><article-title>KAP1 controls endogenous retroviruses in embryonic stem cells</article-title><source>Nature</source><volume>463</volume><fpage>237</fpage><lpage>240</lpage><pub-id pub-id-type="doi">10.1038/nature08674</pub-id><pub-id pub-id-type="pmid">20075919</pub-id></element-citation></ref><ref id="bib42"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Rowe</surname><given-names>HM</given-names></name><name><surname>Kapopoulou</surname><given-names>A</given-names></name><name><surname>Corsinotti</surname><given-names>A</given-names></name><name><surname>Fasching</surname><given-names>L</given-names></name><name><surname>Macfarlan</surname><given-names>TS</given-names></name><name><surname>Tarabay</surname><given-names>Y</given-names></name><name><surname>Viville</surname><given-names>S</given-names></name><name><surname>Jakobsson</surname><given-names>J</given-names></name><name><surname>Pfaff</surname><given-names>SL</given-names></name><name><surname>Trono</surname><given-names>D</given-names></name></person-group><year>2013</year><article-title>TRIM28 repression of retrotransposon-based enhancers is necessary to preserve transcriptional dynamics in embryonic stem cells</article-title><source>Genome Research</source><volume>23</volume><fpage>452</fpage><lpage>461</lpage><pub-id pub-id-type="doi">10.1101/gr.147678.112</pub-id><pub-id pub-id-type="pmid">23233547</pub-id></element-citation></ref><ref id="bib43"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Schauer</surname><given-names>SN</given-names></name><name><surname>Carreira</surname><given-names>PE</given-names></name><name><surname>Shukla</surname><given-names>R</given-names></name><name><surname>Gerhardt</surname><given-names>DJ</given-names></name><name><surname>Gerdes</surname><given-names>P</given-names></name><name><surname>Sanchez-Luque</surname><given-names>FJ</given-names></name><name><surname>Nicoli</surname><given-names>P</given-names></name><name><surname>Kindlova</surname><given-names>M</given-names></name><name><surname>Ghisletti</surname><given-names>S</given-names></name><name><surname>Santos</surname><given-names>AD</given-names></name><name><surname>Rapoud</surname><given-names>D</given-names></name><name><surname>Samuel</surname><given-names>D</given-names></name><name><surname>Faivre</surname><given-names>J</given-names></name><name><surname>Ewing</surname><given-names>AD</given-names></name><name><surname>Richardson</surname><given-names>SR</given-names></name><name><surname>Faulkner</surname><given-names>GJ</given-names></name></person-group><year>2018</year><article-title>L1 retrotransposition is a common feature of mammalian hepatocarcinogenesis</article-title><source>Genome Research</source><volume>28</volume><fpage>639</fpage><lpage>653</lpage><pub-id pub-id-type="doi">10.1101/gr.226993.117</pub-id><pub-id pub-id-type="pmid">29643204</pub-id></element-citation></ref><ref id="bib44"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Schultz</surname><given-names>DC</given-names></name><name><surname>Ayyanathan</surname><given-names>K</given-names></name><name><surname>Negorev</surname><given-names>D</given-names></name><name><surname>Maul</surname><given-names>GG</given-names></name><name><surname>Rauscher</surname><given-names>FJ</given-names></name></person-group><year>2002</year><article-title>SETDB1: a novel KAP-1-associated histone H3, lysine 9-specific methyltransferase that contributes to HP1-mediated silencing of euchromatic genes by KRAB zinc-finger proteins</article-title><source>Genes &#x00026; Development</source><volume>16</volume><fpage>919</fpage><lpage>932</lpage><pub-id pub-id-type="doi">10.1101/gad.973302</pub-id><pub-id pub-id-type="pmid">11959841</pub-id></element-citation></ref><ref id="bib45"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Semba</surname><given-names>K</given-names></name><name><surname>Araki</surname><given-names>K</given-names></name><name><surname>Matsumoto</surname><given-names>K</given-names></name><name><surname>Suda</surname><given-names>H</given-names></name><name><surname>Ando</surname><given-names>T</given-names></name><name><surname>Sei</surname><given-names>A</given-names></name><name><surname>Mizuta</surname><given-names>H</given-names></name><name><surname>Takagi</surname><given-names>K</given-names></name><name><surname>Nakahara</surname><given-names>M</given-names></name><name><surname>Muta</surname><given-names>M</given-names></name><name><surname>Yamada</surname><given-names>G</given-names></name><name><surname>Nakagata</surname><given-names>N</given-names></name><name><surname>Iida</surname><given-names>A</given-names></name><name><surname>Ikegawa</surname><given-names>S</given-names></name><name><surname>Nakamura</surname><given-names>Y</given-names></name><name><surname>Araki</surname><given-names>M</given-names></name><name><surname>Abe</surname><given-names>K</given-names></name><name><surname>Yamamura</surname><given-names>K</given-names></name></person-group><year>2013</year><article-title>Ectopic expression of Ptf1a induces spinal defects, urogenital defects, and anorectal malformations in Danforth's short tail mice</article-title><source>PLOS Genetics</source><volume>9</volume><elocation-id>e1003204</elocation-id><pub-id pub-id-type="doi">10.1371/journal.pgen.1003204</pub-id><pub-id pub-id-type="pmid">23436999</pub-id></element-citation></ref><ref id="bib46"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Sripathy</surname><given-names>SP</given-names></name><name><surname>Stevens</surname><given-names>J</given-names></name><name><surname>Schultz</surname><given-names>DC</given-names></name></person-group><year>2006</year><article-title>The KAP1 corepressor functions to coordinate the assembly of de novo HP1-demarcated microenvironments of heterochromatin required for KRAB zinc finger protein-mediated transcriptional repression</article-title><source>Molecular and Cellular Biology</source><volume>26</volume><fpage>8623</fpage><lpage>8638</lpage><pub-id pub-id-type="doi">10.1128/MCB.00487-06</pub-id><pub-id pub-id-type="pmid">16954381</pub-id></element-citation></ref><ref id="bib47"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Thomas</surname><given-names>JH</given-names></name><name><surname>Schneider</surname><given-names>S</given-names></name></person-group><year>2011</year><article-title>Coevolution of retroelements and tandem zinc finger genes</article-title><source>Genome Research</source><volume>21</volume><fpage>1800</fpage><lpage>1812</lpage><pub-id pub-id-type="doi">10.1101/gr.121749.111</pub-id><pub-id pub-id-type="pmid">21784874</pub-id></element-citation></ref><ref id="bib48"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Thompson</surname><given-names>PJ</given-names></name><name><surname>Macfarlan</surname><given-names>TS</given-names></name><name><surname>Lorincz</surname><given-names>MC</given-names></name></person-group><year>2016</year><article-title>Long terminal repeats: from parasitic elements to building blocks of the transcriptional regulatory repertoire</article-title><source>Molecular Cell</source><volume>62</volume><fpage>766</fpage><lpage>776</lpage><pub-id pub-id-type="doi">10.1016/j.molcel.2016.03.029</pub-id><pub-id pub-id-type="pmid">27259207</pub-id></element-citation></ref><ref id="bib49"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Treger</surname><given-names>RS</given-names></name><name><surname>Pope</surname><given-names>SD</given-names></name><name><surname>Kong</surname><given-names>Y</given-names></name><name><surname>Tokuyama</surname><given-names>M</given-names></name><name><surname>Taura</surname><given-names>M</given-names></name><name><surname>Iwasaki</surname><given-names>A</given-names></name></person-group><year>2019</year><article-title>The lupus susceptibility locus Sgp3 encodes the suppressor of endogenous retrovirus expression SNERV</article-title><source>Immunity</source><volume>50</volume><fpage>334</fpage><lpage>347</lpage><pub-id pub-id-type="doi">10.1016/j.immuni.2018.12.022</pub-id><pub-id pub-id-type="pmid">30709743</pub-id></element-citation></ref><ref id="bib50"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Vlangos</surname><given-names>CN</given-names></name><name><surname>Siuniak</surname><given-names>AN</given-names></name><name><surname>Robinson</surname><given-names>D</given-names></name><name><surname>Chinnaiyan</surname><given-names>AM</given-names></name><name><surname>Lyons</surname><given-names>RH</given-names></name><name><surname>Cavalcoli</surname><given-names>JD</given-names></name><name><surname>Keegan</surname><given-names>CE</given-names></name></person-group><year>2013</year><article-title>Next-generation sequencing identifies the Danforth's short tail mouse mutation as a retrotransposon insertion affecting Ptf1a expression</article-title><source>PLOS Genetics</source><volume>9</volume><elocation-id>e1003205</elocation-id><pub-id pub-id-type="doi">10.1371/journal.pgen.1003205</pub-id><pub-id pub-id-type="pmid">23437000</pub-id></element-citation></ref><ref id="bib51"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Wang</surname><given-names>J</given-names></name><name><surname>Xie</surname><given-names>G</given-names></name><name><surname>Singh</surname><given-names>M</given-names></name><name><surname>Ghanbarian</surname><given-names>AT</given-names></name><name><surname>Rask&#x000f3;</surname><given-names>T</given-names></name><name><surname>Szvetnik</surname><given-names>A</given-names></name><name><surname>Cai</surname><given-names>H</given-names></name><name><surname>Besser</surname><given-names>D</given-names></name><name><surname>Prigione</surname><given-names>A</given-names></name><name><surname>Fuchs</surname><given-names>NV</given-names></name><name><surname>Schumann</surname><given-names>GG</given-names></name><name><surname>Chen</surname><given-names>W</given-names></name><name><surname>Lorincz</surname><given-names>MC</given-names></name><name><surname>Ivics</surname><given-names>Z</given-names></name><name><surname>Hurst</surname><given-names>LD</given-names></name><name><surname>Izsv&#x000e1;k</surname><given-names>Z</given-names></name></person-group><year>2014</year><article-title>Primate-specific endogenous retrovirus-driven transcription defines naive-like stem cells</article-title><source>Nature</source><volume>516</volume><fpage>405</fpage><lpage>409</lpage><pub-id pub-id-type="doi">10.1038/nature13804</pub-id><pub-id pub-id-type="pmid">25317556</pub-id></element-citation></ref><ref id="bib52"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Wolf</surname><given-names>D</given-names></name><name><surname>Hug</surname><given-names>K</given-names></name><name><surname>Goff</surname><given-names>SP</given-names></name></person-group><year>2008</year><article-title>TRIM28 mediates primer binding site-targeted silencing of Lys1,2 tRNA-utilizing retroviruses in embryonic cells</article-title><source>PNAS</source><volume>105</volume><fpage>12521</fpage><lpage>12526</lpage><pub-id pub-id-type="doi">10.1073/pnas.0805540105</pub-id><pub-id pub-id-type="pmid">18713861</pub-id></element-citation></ref><ref id="bib53"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Wolf</surname><given-names>G</given-names></name><name><surname>Greenberg</surname><given-names>D</given-names></name><name><surname>Macfarlan</surname><given-names>TS</given-names></name></person-group><year>2015a</year><article-title>Spotting the enemy within: targeted silencing of foreign DNA in mammalian genomes by the Kr&#x000fc;ppel-associated box zinc finger protein family</article-title><source>Mobile DNA</source><volume>6</volume><elocation-id>17</elocation-id><pub-id pub-id-type="doi">10.1186/s13100-015-0050-8</pub-id><pub-id pub-id-type="pmid">26435754</pub-id></element-citation></ref><ref id="bib54"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Wolf</surname><given-names>G</given-names></name><name><surname>Yang</surname><given-names>P</given-names></name><name><surname>F&#x000fc;chtbauer</surname><given-names>AC</given-names></name><name><surname>F&#x000fc;chtbauer</surname><given-names>EM</given-names></name><name><surname>Silva</surname><given-names>AM</given-names></name><name><surname>Park</surname><given-names>C</given-names></name><name><surname>Wu</surname><given-names>W</given-names></name><name><surname>Nielsen</surname><given-names>AL</given-names></name><name><surname>Pedersen</surname><given-names>FS</given-names></name><name><surname>Macfarlan</surname><given-names>TS</given-names></name></person-group><year>2015b</year><article-title>The KRAB zinc finger protein ZFP809 is required to initiate epigenetic silencing of endogenous retroviruses</article-title><source>Genes &#x00026; Development</source><volume>29</volume><fpage>538</fpage><lpage>554</lpage><pub-id pub-id-type="doi">10.1101/gad.252767.114</pub-id><pub-id pub-id-type="pmid">25737282</pub-id></element-citation></ref><ref id="bib55"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Yamauchi</surname><given-names>M</given-names></name><name><surname>Freitag</surname><given-names>B</given-names></name><name><surname>Khan</surname><given-names>C</given-names></name><name><surname>Berwin</surname><given-names>B</given-names></name><name><surname>Barklis</surname><given-names>E</given-names></name></person-group><year>1995</year><article-title>Stem cell factor binding to retrovirus primer binding site silencers</article-title><source>Journal of Virology</source><volume>69</volume><fpage>1142</fpage><lpage>1149</lpage><pub-id pub-id-type="doi">10.1128/JVI.69.2.1142-1149.1995</pub-id><pub-id pub-id-type="pmid">7529329</pub-id></element-citation></ref><ref id="bib56"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Zhang</surname><given-names>Y</given-names></name><name><surname>Liu</surname><given-names>T</given-names></name><name><surname>Meyer</surname><given-names>CA</given-names></name><name><surname>Eeckhoute</surname><given-names>J</given-names></name><name><surname>Johnson</surname><given-names>DS</given-names></name><name><surname>Bernstein</surname><given-names>BE</given-names></name><name><surname>Nusbaum</surname><given-names>C</given-names></name><name><surname>Myers</surname><given-names>RM</given-names></name><name><surname>Brown</surname><given-names>M</given-names></name><name><surname>Li</surname><given-names>W</given-names></name><name><surname>Liu</surname><given-names>XS</given-names></name></person-group><year>2008</year><article-title>Model-based analysis of ChIP-Seq (MACS)</article-title><source>Genome Biology</source><volume>9</volume><elocation-id>R137</elocation-id><pub-id pub-id-type="doi">10.1186/gb-2008-9-9-r137</pub-id><pub-id pub-id-type="pmid">18798982</pub-id></element-citation></ref></ref-list></back><sub-article id="sa1" article-type="decision-letter"><front-stub><article-id pub-id-type="doi">10.7554/eLife.56337.sa1</article-id><title-group><article-title>Decision letter</article-title></title-group><contrib-group><contrib contrib-type="editor"><name><surname>Bourc'his</surname><given-names>Deborah</given-names></name><role>Reviewing Editor</role><aff><institution>Institut Curie</institution><country>France</country></aff></contrib></contrib-group><contrib-group><contrib contrib-type="reviewer"><name><surname>Bourc'his</surname><given-names>Deborah</given-names></name><role>Reviewer</role><aff><institution>Institut Curie</institution><country>France</country></aff></contrib><contrib contrib-type="reviewer"><name><surname>Barau</surname><given-names>Joan</given-names></name><role>Reviewer</role><aff><institution>IMB Mainz</institution><country>Germany</country></aff></contrib></contrib-group></front-stub><body><boxed-text position="float" orientation="portrait"><p>In the interests of transparency, eLife publishes the most substantive revision requests and the accompanying author responses.</p></boxed-text><p><bold>Acceptance summary:</bold></p><p>This is the first study that addresses in vivo the targets and function of genomic clusters of KRAB-ZFP-encoding genes. This family of proteins reveals the complexity, redundancy and specificity (of targets and of cellular type of expression) of the evolutionary processes that control ever adapting retrotransposons.</p><p><bold>Decision letter after peer review:</bold></p><p>Thank you for submitting your article "KRAB-zinc finger protein gene expansion in response to active retrotransposons in the murine lineage" for consideration by <italic>eLife</italic>. Your article has been reviewed by three peer reviewers, including Deborah Bourc'his as the Reviewing Editor and Reviewer #1, and the evaluation has been overseen by George Perry as the Senior Editor. The following individual involved in review of your submission has agreed to reveal their identity: Joan Barau (Reviewer #3).</p><p>The reviewers have discussed the reviews with one another and the Reviewing Editor has drafted this decision to help you prepare a revised submission.</p><p>Summary:</p><p>Restriction of transposable elements relies on multiple pathways, among which KRAB-ZFP proteins play a key role through their diversity and fast evolvability. The repertoire of KRAB-ZFPs is highly species-specific, likely driven by the pressure to adapt to host retrotransposon load, although they can also be repurposed for other regulatory functions over evolutionary time. Their functional study is challenging, because of their redundant and complementary functions, and also because of their sequence similarity and common organization in large genomic clusters.</p><p>In this study, the authors probed the genomic targets and roles of a large set of KRAB-ZFPs that are specifically expressed in pluripotent cells and that have recently arisen in the mouse genome, which contains a large fraction of retrotransposons of LTR and non-LTR classes that are still potentially mobile. Using systematic expression of epitope-tagged KRAB-ZFP transgenes and deletions of large clusters in both embryonic stem cells and in mice, the authors demonstrate that: 1) mouse KRAB-ZFPs primarily bind to retrotransposons, 2) large KRAB-ZFP gene clusters are required for transcriptional repression of evolutionarily young retrotransposons (LINE, IAP, and ETn) in pluripotent cells, 3) large pluripotency-associated KRAB-ZFP gene clusters are mostly dispensable for viability and development, and 4) these same KRAB-ZFP clusters have very low effect on the mobility of some young LTR retrotransposons of the ETn/MusD class, although genetic background may play a role. Interbreeding between KO/KO animals over several generations may enhance mobilization and phenotypic outcomes, but this was not addressed here.</p><p>This is the first study of this scale to investigate KRAB-ZFP targets and function in mice. The approaches are state-of-the art, involving large, megabase-scale deletions, development of capture-sequencing for mapping de novo insertions of ETn/MusD copies, and analysis of the effects of KRAB-ZFP on retrotransposon mobilization at the organism level. The manuscript is clear and conclusions are fair. However, the search of new insertions in mutant mouse pedigrees is not fully conclusive, since the contribution of the genetic background on ETn/MusD mobilization seems to be higher than the effects of cluster deletions &#x02013; if any.</p><p>Essential revisions:</p><p>1) Related to the identification of new insertions: the absence of basic reporting on the efficiency of the capture sequencing approach renders difficult to evaluate if the numbers provided are realistic or not. Recovery rate should be assessed using reference or parental retrotransposons. Also, a table with all insertions (genomic coordinates) and individuals could not be found and should be provided. PCR-validating a significant number of putative somatic insertions would be useful to estimate the rate of false positives, since these insertions have low-read counts (only germline insertions were tested in Figure 4&#x02014;figure supplement 3A).</p><p>2) It is not very clear as to whether the retrotransposition rate is also low in ES cells. Could the authors be more clear about this? On this matter, the authors should test the rate of retrotransposition using the plasmid-borne ETn/MusD retrotransposition assay in the context of cl4 KO/KO ES cells (as used in Figure Sup3C). Similarly, they could use available LINE-1 and/or IAP retrotransposition cassettes in cl2 KO/KO ES cells, where these elements are the most reactivated (according to the RNA-seq data).</p><p>3) As most of the manuscript is heavily focused on ETn elements, the authors should provide more information on this family. Notably, the authors should make clear from the beginning that ETnERV = MusD, explain what MusD, ETnI and ETnII are -relative to each others- and give indications about the number of these different ETn-related elements in the mouse reference genome (C57bl6/J) genome and also in 129Sv.</p><p>4) Figure 2C: Was CpG methylation calculated over the entire sequence of full-length copies only? Or are the methylation values extracted from all ETn- or IAP-related reads ? This should be clarified. Also, the authors should repeat their analysis by specifically focusing on CpG methylation located in the LTR sequence of these elements. This is where DNA methylation has been shown to be targeted and to matter for retrotransposons repression. Changes may be greater.</p><p>5) The data graphed in Figure 4B do not seem to reflect the pedigree depicted in Figure 4A. Only 24 progenies could be counted for the KO/WT x KO/WT matings (12 WT/WT or WT/KO, and 12 KO/KO). In addition, some data points (such as the second from top, x=KO, y=~65) seem to originate from a different type of cross (heterozygote x homozygote). The overall conclusion (that KO produce on average more ETn insertions) could be quite different if a small number of individuals was misplaced in the graph, given the size of the samples. It is important to make sure which animals were accounted from the pedigree in 4A to produce the boxplot 4B.</p></body></sub-article><sub-article id="sa2" article-type="reply"><front-stub><article-id pub-id-type="doi">10.7554/eLife.56337.sa2</article-id><title-group><article-title>Author response</article-title></title-group></front-stub><body><disp-quote content-type="editor-comment"><p>Essential revisions:</p><p>1) Related to the identification of new insertions: the absence of basic reporting on the efficiency of the capture sequencing approach renders difficult to evaluate if the numbers provided are realistic or not. Recovery rate should be assessed using reference or parental retrotransposons. Also, a table with all insertions (genomic coordinates) and individuals could not be found and should be provided. PCR-validating a significant number of putative somatic insertions would be useful to estimate the rate of false positives, since these insertions have low-read counts (only germline insertions were tested in Figure 4&#x02014;figure supplement 3A).</p></disp-quote><p>We compiled a list of largely intact MMETN insertions (two RLTRETN_Mm LTRs flanking an internal MMETn-int region) in the mm9 reference genome and we identified &#x0003e;95% of these 309 insertions in the majority of samples. The average read number supporting these insertions was approximately two-fold higher than for the novel germ-line insertions that we identified. This difference is most likely a result of the heterozygosity of the novel insertions. We added this data to Figure 4&#x02014;figure supplement 3B and refer to it in the main text.</p><p>As for the PCR validation, the five tested and confirmed insertions were actually putative somatic insertions (not germ-line insertions) with low read counts (Figure 4&#x02014;figure supplement 3A). We only tested 5 somatic insertions for validation by PCR, and all 5 were confirmed. This suggests that our capture -seq pipeline has a low false positive rate.</p><p>We compiled a list of all novel insertions with supporting read numbers and added it as Figure 4&#x02014;source data 1.</p><disp-quote content-type="editor-comment"><p>2) It is not very clear as to whether the retrotransposition rate is also low in ES cells. Could the authors be more clear about this? On this matter, the authors should test the rate of retrotransposition using the plasmid-borne ETn/MusD retrotransposition assay in the context of cl4 KO/KO ES cells (as used in Figure Sup3C). Similarly, they could use available LINE-1 and/or IAP retrotransposition cassettes in cl2 KO/KO ES cells, where these elements are the most reactivated (according to the RNA-seq data).</p></disp-quote><p>We screened the two KO/WT ES cell lines that had been used to derive the Chr4-cl KO mouse line by Capture-seq to exclude that newly identified insertions had been present in the founder ES cells already. We could not identify any new insertions in these cell lines either; however, we did not screen KO/KO ES cells so we cannot speculate about retrotransposition rate of endogenous ETn elements in WT and KO ES cells.</p><p>We attempted repeatedly to perform our retrotransposition assays in Chr4-cl WT and KO ES cells using ETn and MusD reporters, yet were not able to observe resistant colonies that would indicate retrotransposition events in any of the ES cell clones (WT or KO). Our explanation is that KRAB-ZFP independent mechanisms restrict retrotransposition in ES cells. The transfection efficiency is also lower in ES cells which could contribute to reduced rates relative to other cultured cell lines.</p><p>As for Chr2-cl KO ES cells, such experiments are, together with Capture-seq screens of Chr2-cl KO mice on the way but are planned to be included in a follow-up study that focuses on IAP and L1 retrotransposition.</p><disp-quote content-type="editor-comment"><p>3) As most of the manuscript is heavily focused on ETn elements, the authors should provide more information on this family. Notably, the authors should make clear from the beginning that ETnERV = MusD, explain what MusD, ETnI and ETnII are -relative to each others- and give indications about the number of these different ETn-related elements in the mouse reference genome (C57bl6/J) genome and also in 129Sv.</p></disp-quote><p>We have added the following to the manuscript:</p><p>&#x0201c;including the closely related MMETn (hereafter referred to as ETn) and ETnERV (also known as MusD) elements (Figure 1A and Figure 1&#x02014;source data 2). ETn elements are non-autonomous LTR retrotransposons that require trans-complementation by the fully coding ETnERV elements that contain Gag Pro and Pol genes Ribet et al., 2004().These elements have accumulated to ~240 and ~100 copies in the reference C57BL/6 genome, respectively, with ~550 solitary LTRs Baust et al., 2003(). Both ETn and ETnERVs are still active, generating polymorphisms and mutations in several mouse strains Gagnier et al., 2019().&#x0201d;</p><disp-quote content-type="editor-comment"><p>4) Figure 2C: Was CpG methylation calculated over the entire sequence of full-length copies only? Or are the methylation values extracted from all ETn- or IAP-related reads ? This should be clarified. Also, the authors should repeat their analysis by specifically focusing on CpG methylation located in the LTR sequence of these elements. This is where DNA methylation has been shown to be targeted and to matter for retrotransposons repression. Changes may be greater.</p></disp-quote><p>We determined CpG methylation separately on LTRs and internal regions. However, the loss of CpG methylation at ETn LTRs was actually less pronounced than in the internal regions. We believe that the KRAB-ZFP binding prevents demethylation at the immediate binding sites and all ETn-targeting KRAB-ZFPs we identified bind the internal ETn region and not the LTRs.</p><p>As for IAP, we saw a substantial decrease (20% of CpG sites) in methylation in one type of IAP LTR (IAPLTR4) which is directly targeted by a Chr4-cl KRAB-ZFP (Gm13157). However, the other targeted IAP LTRs (e.g. by Gm21082) did not display any methylation loss family-wide, most likely due to redundancy of KRAB-ZFPs that bind IAP elements. Preliminary data we have collected indicates however that a small subset of IAP LTR-subtypes that are bound by Gm21082, lose methylation in Chr4 KOs.</p><disp-quote content-type="editor-comment"><p>5) The data graphed in Figure 4B do not seem to reflect the pedigree depicted in Figure 4A. Only 24 progenies could be counted for the KO/WT x KO/WT matings (12 WT/WT or WT/KO, and 12 KO/KO). In addition, some data points (such as the second from top, x=KO, y=~65) seem to originate from a different type of cross (heterozygote x homozygote). The overall conclusion (that KO produce on average more ETn insertions) could be quite different if a small number of individuals was misplaced in the graph, given the size of the samples. It is important to make sure which animals were accounted from the pedigree in 4A to produce the boxplot 4B.</p></disp-quote><p>The data in Figure 4B indeed comes from mice derived from three different mating (two KO/WT x KO/WT and one KO x KO/WT in the B6/129 F2 background). Not all animals from these matings were included in the analysis, only the ones where the DNA was purified from juvenile tail tissue. We did that to exclude a potential bias since tail samples showed generally more novel insertions than other tissues (Figure 4&#x02014;figure supplement 3C).</p><p>We now show the data separated by matings as well as combined, and describe the data in the main text. This analysis revealed that in only one of the three matings, the difference in ETn insertions between KO/WT and KO reaches significance (p value &#x0003c; 0.05). We re-wrote the paragraph in the main text accordingly.</p></body></sub-article></article>