eISSN: 2221-6197 DOI: 10.31301/2221-6197

Bioinformatic resources for in silico search of the CRISPR loci in the genomes of prokaryotes

Year: 2017

Pages: 229--244

Number: Volume 9, issue 3

Type: scientific article

Summary:

Brief characteristics of CRISPR loci which found in approximately half of the bacteria and most archaea are given. Their typical organization, an important element of which serve CRISPR-cassette that contains unique spacers alternating with identical direct repeats are shown. Specialized search programs for CRISPR-cassettes in the sequenced genomes of microorganisms and metagenomic data by identifying of repeating sections in them are briefly considered. The web pages of these programs and their purpose and capabilities are shown in tabular form. Databases for CRISPR-loci showing their web addresses are described. Almost all available literature on the matter and relevant Internet resources are analyzed.

Keywords:

CRISPR, CRISPR/Cas system, CRISPR-locus, CRISPR-cassette, spacer, quasi-tandem repeat, protospacer, software, web-resource, database

References:

  1. Баймиев Ан.Х., Кулуев Б.Р., Вершинина З.Р. Князев А.В., Чемерис Д.А., Геращенков Г.А., Баймиев Ал.Х., Чемерис А.В. CRISPR/Cas редактирование геномов (растений) и общество // Биомика. 2017. Т.9. С.183-202.
  2. Кулуев Б.Р., Геращенков Г.А., Рожнова Н.А., Баймиев Ан.Х., Вершинина З.Р., Князев А.В., Матниязов Р.Т., Гумерова Г.Р., Никоноров Ю.М., Чемерис Д.А., Баймиев Ал.Х., Чемерис А.В. CRISPR/Cas редактирование геномов растений // Биомика. 2017. Т.9. С.155-182.
  3. Кулуев Б.Р., Баймиев Ан.Х., Чемерис Д.А., Матниязов Р.Т., Геращенков Г.А., Никоноров Ю.М., Баймиев Ал.Х., Чемерис А.В. Применение CRISPR-локусов не для редактирования геномов // Биомика. 2017а. Т.9. С.271-28
  4. Чемерис Д.А., Кирьянова О.Ю., Губайдуллин И.М., Чемерис А.В. Дизайн праймеров для полимеразной цепной реакции. (Краткий обзор компьютерных программ и баз данных) // Биомика. 2016. Т.8. С.215-238.
  5. Чемерис Д.А., Кирьянова О.Ю., Геращенков Г.А., Кулуев Б.Р., Рожнова Н.А., Матниязов Р.Т., Баймиев Ан.Х., Баймиев Ал.Х., Губайдуллин И.М., Чемерис А.В. Биоинформатические ресурсы для CRISPR/Cas редактирования геномов // Биомика. Т.9. С.203-228.
  6. Abby S.S., Néron B., Ménager H., Touchon M., Rocha E.P. MacSyFinder: a program to mine genomes for molecular systems with an application to CRISPR-Cas systems // PLoS One. 2014. V.9(10):e110726.
  7. Abouelhoda M.I., Kurtz S., Ohlebusch E. Replacing suffix trees with enhanced suffix arrays // J. Discrete Algorithms. 2004. V.2. P.53–86.
  8. Alkhnbashi O.S., Costa F., Shah S.A., Garrett R.A., Saunders S.J., Backofen R. CRISPRstrand: predicting repeat orientations to determine the crRNA-encoding strand at CRISPR loci // Bioinformatics. 2014. V.30. P.489-496.
  9. Alkhnbashi O.S., Shah S.A., Garrett R.A., Saunders S.J., Costa F., Backofen R. Characterizing leader sequences of CRISPR loci // Bioinformatics. 2016. V.32. P.i576-i585.
  10. Anderson R.E., Brazelton W.J., Baross J.A. Using CRISPRs as a metagenomic tool to identify microbial hosts of a diffuse flow hydrothermal vent viral assemblage // FEMS Microbiol. Ecol. 2011. V.77. P.120-133.
  11. Barrangou R., Horvath P. A decade of discovery: CRISPR functions and applications // Nat. Microbiol. 2017. V.2:17092.
  12. Bao Z., Eddy S.R. Automated de novo identification of repeat sequence families in sequenced genomes // Genome Res. 2002. V.12. P.1269-1276.
  13. Ben-Bassat I., Chor B. CRISPR Detection from Short Reads Using Partial Overlap Graphs // Intern. Res. Comput. Mol. Biol. RECOMB 2015: Research in Computational Molecular Biology. P.16-27.
  14. Ben-Bassat I., Chor B. CRISPR detection from short reads using partial overlap graphs // J. Comput. Biol. 2016. V.23. P.461-471.
  15. Benson G. Tandem repeats finder: a program to analyze DNA sequences // Nucleic Acids Res. 1999. V.27. P.573-580.
  16. Biswas A., Gagnon J.N., Brouns S.J., Fineran P.C., Brown C.M. CRISPRTarget: bioinformatic prediction and analysis of crRNA targets // RNA Biol. 2013. V.10. P.817-827.
  17. Biswas A., Fineran P.C., Brown C.M. Accurate computational prediction of the transcribed strand of CRISPR non-coding RNAs // Bioinformatics. 2014. V.30. P.1805-1813.
  18. Biswas A., Fineran P.C., Brown C.M. Computational Detection of CRISPR/crRNA Targets // Methods Mol. Biol. 2015. V.1311. P.77-89.
  19. Biswas A., Staals R.H., Morales S.E., Fineran P.C., Brown C.M. CRISPRDetect: A flexible algorithm to define CRISPR arrays // BMC Genomics. 2016. V.17:356.
  20. Bland C., Ramsey T.L., Sabree F., Lowe M., Brown K., Kyrpides N.C., Hugenholtz P. CRISPR recognition tool (CRT): a tool for automatic detection of clustered regularly interspaced palindromic repeats // BMC Bioinformatics. 2007. V.8:209.
  21. Bolotin A., Quinquis B., Sorokin A., Ehrlich S.D. Clustered regularly interspaced short palindrome repeats (CRISPRs) have spacers of extrachromosomal origin // Microbiology. 2005. V.151. P.2551–2561.
  22. Drevet C., Pourcel C. How to identify CRISPRs in sequencing data // Methods Mol. Biol. 2012. V.905. P.15-27.
  23. Dsouza M., Larsen N., Overbeek R. Searching for patterns in genomic data // Trends Genet. 1997. V.13. P.497-498.
  24. Durand P., Mahé F., Valin A.S., Nicolas J. Browsing repeats in genomes: Pygram and an application to non-coding region analysis // BMC Bioinformatics. 2006. V.7:477.
  25. Dutilh B.E., Schmieder R., Nulton J., Felts B., Salamon P., Edwards R.A., Mokili J.L. Reference-independent comparative metagenomics using cross-assembly: crass // Bioinformatics. 2012. V.28. P.3225-3231.
  26. Edgar R.C. MUSCLE: a multiple sequence alignment method with reduced time and space complexity // BMC Bioinformatics. 2004. V.5: 113.
  27. Edgar R.C. PILER-CR: fast and accurate identification of CRISPR repeats // BMC Bioinformatics. 2007. V.8:18.
  28. Edgar R.C., Myers E.W. PILER: identification and classification of genomic repeats // Bioinformatics. 2005. V.21. Suppl 1:i152-8.
  29. Ge R., Mai G., Wang P., Zhou M., Luo Y., Cai Y., Zhou F. CRISPRdigger: detecting CRISPRs with better direct repeat annotations // Sci. Rep. 2016. V.6:32942.
  30. Godde J.S., Bickerton A. The repetitive DNA elements called CRISPRs and their associated genes: evidence of horizontal transfer among prokaryotes // J. Mol. Evol. 2006. V.62. P.718-729.
  31. Gogleva A.A., Gelfand M.S., Artamonova I.I. Comparative analysis of CRISPR cassettes from the human gut metagenomic contigs // BMC Genomics. 2014. V.15:202.
  32. Grissa I., Vergnaud G., Pourcel C. CRISPRFinder: a web tool to identify clustered regularly interspaced short palindromic repeats // Nucl. Acids Res. 2007. V.35. W52-57.
  33. Grissa I., Vergnaud G., Pourcel C. The CRISPRdb database and tools to display CRISPRs and to generate dictionaries of spacers and repeats // BMC Bioinformatics. 2007. V8:172.
  34. Grissa I., Vergnaud G., Pourcel C. CRISPRcompar: a website to compare clustered regularly interspaced short palindromic repeats // Nucl. Acids Res. 2008. V.36. W145-148.
  35. Gusfield D., Stoye J. Linear time algorithms for finding and representing all the tandem repeats in a string // J. Computer and System Sciences. 2004. V.69. P.525-546.
  36. Heidelberg J.F., Nelson W.C., Schoenfeld T., Bhaya D. Germ warfare in a microbial mat community: CRISPRs provide insights into the co-evolution of host and viral genomes // PLoS One. 2009. V.4(1):e4169.
  37. Huntemann M., Ivanova N.N., Mavromatis K., Tripp H.J., Paez-Espino D., Palaniappan K., Szeto E., Pillay M., Chen I.M., Pati A., Nielsen T., Markowitz V.M., Kyrpides N.C. The standard operating procedure of the DOE-JGI Microbial Genome Annotation Pipeline (MGAP v.4) // Stand Genomic Sci. 2015. V.10:86.
  38. Huntemann M., Ivanova N.N., Mavromatis K., Tripp H.J., Paez-Espino D., Tennessen K., Palaniappan K., Szeto E., Pillay M., Chen I.M., Pati A., Nielsen T., Markowitz V.M., Kyrpides N.C. The standard operating procedure of the DOE-JGI Metagenome Annotation Pipeline (MAP v.4) // Stand Genomic Sci. 2016. V.11:17.
  39. Jansen R., Embden J.D., Gaastra W., Schouls L.M. Identification of genes that are associated with DNA repeats in prokaryotes // Mol Microbiol. 2002. V. 43. P. 1565–1575.
  40. Ishino Y., Shinagawa H., Makino K., Amemura M., Nakata A. Nucleotide sequence of the iap gene, responsible for alkaline phosphatase isozyme conversion in Escherichia coli, and identification of the gene product // J. Bacteriol. 1987. V. 169. P. 5429–5433.
  41. Ivanova N., Daum C., Lang E., Abt B., Kopitz M., Saunders E., Lapidus A., Lucas S., Glavina Del Rio T., Nolan M., Tice H., Copeland A., Cheng J.F., Chen F., Bruce D., Goodwin L., Pitluck S., Mavromatis K., Pati A., Mikhailova N., Chen A., Palaniappan K., Land M., Hauser L., Chang Y.J., Jeffries C.D., Detter J.C., Brettin T., Rohde M., Göker M., Bristow J., Markowitz V., Eisen J.A., Hugenholtz P., Kyrpides N.C., Klenk H.P. Complete genome sequence of Haliangium ochraceum type strain (SMP-2) // Stand Genomic Sci. 2010. V.2. P.96-106.
  42. Koonin E.V., Makarova K.S., Zhang F. Diversity, classification and evolution of CRISPR-Cas systems // Curr. Opin. Microbiol. 2017. V.37. P.67-78.
  43. Kunin V., Sorek R., Hugenholtz P. Evolutionary conservation of sequence and secondary structures in CRISPR repeats // Genome Biol. 2007. V.8(4):R61.
  44. Kurtz S., Choudhuri J.V., Ohlebusch E., Schleiermacher C., Stoye J., Giegerich R. REPuter: the manifold applications of repeat analysis on a genomic scale // Nucleic Acids Res. 2001. V.29. P.4633-4642.
  45. Kurtz S., Schleiermacher C. REPuter: fast computation of maximal repeats in complete genomes // Bioinformatics. 1999. V.15. P.426-427.
  46. Lange S.J, Alkhnbashi O.S., Rose D., Will S., Backofen R. CRISPRmap: an automated classification of repeat conservation in prokaryotic adaptive immune systems // Nucleic Acids Research. 2013. V.41. P.8034-8044.
  47. Lei J., Sun Y. Assemble CRISPRs from metagenomic sequencing data // Bioinformatics. 2016. V.32. P.i520-i528.
  48. Lefebvre A., Lecroq T., Dauchel H., Alexandre J. FORRepeats: detects repeats on entire chromosomes and between genomes // Bioinformatics. 2003. V.19. P.319-326.
  49. Mai G., Ge R., Sun G., Meng Q., Zhou F. A Comprehensive Curation Shows the Dynamic Evolutionary Patterns of Prokaryotic CRISPRs // Biomed. Res. Int. 2016;2016:7237053.
  50. Mojica F.J., Juez G., Rodríguez-Valera F. Transcription at different salinities of Haloferax mediterranei sequences adjacent to partially modified PstI sites // Mol Microbiol. 1993. V.9. P.613–621.
  51. Mojica F.J., Ferrer C., Juez G., Rodríguez-Valera F. Long stretches of short tandem repeats are present in the largest replicons of the Archaea Haloferax mediterranei and Haloferax volcanii and could be involved in replicon partitioning // Mol Microbiol. 1995. V. 17. P. 85–93.
  52. Mojica F.J., Díez-Villaseñor C., Soria E., Juez G. Biological significance of a family of regularly spaced repeats in the genomes of Archaea, Bacteria and mitochondria // Mol Microbiol. 2000. V. 36. P. 244–246.
  53. Mojica F.J., Díez-Villaseñor C., García-Martínez J., Soria E. Intervening sequences of regularly spaced prokaryotic repeats derive from foreign genetic elements // J Mol Evol. 2005. V. 60. P. 174–182.
  54. Nakata A., Amemura M., Makino K. Unusual nucleotide arrangement with repeated sequences in the Escherichia coli K-12 chromosome // J Bacteriol. 1989. V. 171. P. 3553–3556.
  55. Nicolas J., Rousseau C., Siegel A., Peterlongo P., Coste F., Durand P., Tempel S., Valin A-S., Mahe F. Modeling local repeats on genomic sequences // Research Report RR-6802, INRIA. 2008. pp.43.
  56. Paez-Espino D., Eloe-Fadrosh E.A., Pavlopoulos G.A., Thomas A.D., Huntemann M., Mikhailova N., Rubin E., Ivanova N.N., Kyrpides N.C. Uncovering Earth's virome // Nature. 2016. V.536. P.425-430.
  57. Pevzner P.A., Tang H., Tesler G. De novo repeat classification and fragment assembly // Genome Res. 2004. V.14. P.1786-1796. Erratum in: Genome Res. 2004. V.14. P.2510.
  58. Pourcel C., Salvignol G., Vergnaud G. CRISPR elements in Yersinia pestis acquire new repeats by preferential uptake of bacteriophage DNA, and provide additional tools for evolutionary studies // Microbiology. 2005. V. 151. P. 653–663.
  59. Price A.L., Jones N.C., Pevzner P.A. De novo identification of repeat families in large genomes // Bioinformatics. 2005. V.21. Suppl 1:i351-358.
  60. Rho M., Wu Y.W., Tang H., Doak T.G., Ye Y. Diverse CRISPRs evolving in human microbiomes // PLoS Genet. 2012. V.8(6):e1002441.
  61. Rousseau C., Gonnet M., Le Romancer M., Nicolas J. CRISPI: a CRISPR interactive database // Bioinformatics. 2009. V.25. P.3317–3318.
  62. Skennerton C.T., Imelfort M., Tyson G.W. Crass: identification and reconstruction of CRISPR from unassembled metagenomic data // Nucleic Acids Res. 2013. V.41(10):e105.
  63. Sobreira T.J., Durham A.M., Gruber A. TRAP: automated classification, quantification and annotation of tandemly repeated sequences // Bioinformatics. 2006. V.22. P.361-362.
  64. Sorokin V.A., Gelfand M.S., Artamonova I.I. Evolutionary dynamics of clustered irregularly interspaced short palindromic repeat systems in the ocean metagenome // Appl. Environ. Microbiol. 2010. V.76. P.2136-2144.
  65. Stern A., Mick E., Tirosh I., Sagy O., Sorek R. CRISPR targeting reveals a reservoir of common phages associated with the human gut microbiome // Genome Res. 2012. V.22. P.1985-1994.
  66. Ussery D.W., Binnewies T.T., Gouveia-Oliveira R., Jarmer H., Hallin P.F. Genome update: DNA repeats in bacterial genomes // Microbiology. 2004. V.150. P.3519-3521.
  67. Volfovsky N., Haas B.J., Salzberg S.L. A clustering method for repeat analysis in DNA sequences // Genome Biol. 2001. V.2(8):RESEARCH0027
  68. Zhang Q., Ye Y. Not all predicted CRISPR-Cas systems are equal: isolated cas genes and classes of CRISPR like elements // BMC Bioinformatics. 2017. V.18(1):92.
Download pdf
up
eISSN: 2221-6197 DOI: 10.31301/2221-6197