{"accession": "GCA_023511455.1", "species": "Bacillota bacterium", "seqid": "JAIMBJ010000015.1", "features": [{"seqid": "JAIMBJ010000015.1", "type": "gene", "strand": "+", "end": 63695, "attributes": {"ID": "gene-K6U75_09910", "Name": "K6U75_09910", "gene_biotype": "protein_coding", "gbkey": "Gene", "locus_tag": "K6U75_09910"}, "score": ".", "phase": ".", "start": 62592, "source": "Genbank"}, {"type": "CDS", "score": ".", "source": "GeneMarkS-2+", "seqid": "JAIMBJ010000015.1", "strand": "+", "start": 62592, "phase": "0", "end": 63695, "attributes": {"protein_id": "MCL6475351.1", "ID": "cds-MCL6475351.1", "product": "PEP-CTERM sorting domain-containing protein", "inference": "COORDINATES: ab initio prediction:GeneMarkS-2+", "Name": "MCL6475351.1", "Dbxref": "NCBI_GP:MCL6475351.1", "transl_table": "11", "Parent": "gene-K6U75_09910", "Note": "PEP-CTERM proteins occur%2C often in large numbers%2C in the proteomes of bacteria that also encode an exosortase%2C a predicted intramembrane cysteine proteinase. The presence of a PEP-CTERM domain at a protein's C-terminus predicts cleavage within the sorting domain%2C followed by covalent anchoring to some some component of the (usually Gram-negative) cell surface. Many PEP-CTERM proteins exhibit an unusual sequence composition that includes large numbers of potential glycosylation sites. Expression of one such protein has been shown restore the ability of a bacterium to form floc%2C a type of biofilm.", "gbkey": "CDS", "locus_tag": "K6U75_09910"}}, {"strand": "+", "attributes": {"gene": "purD", "Parent": "gene-K6U75_09905", "locus_tag": "K6U75_09905", "Dbxref": "NCBI_GP:MCL6475350.1", "product": "phosphoribosylamine--glycine ligase", "ID": "cds-MCL6475350.1", "Name": "MCL6475350.1", "transl_table": "11", "protein_id": "MCL6475350.1", "inference": "COORDINATES: similar to AA sequence:RefSeq:WP_013537695.1", "gbkey": "CDS"}, "source": "Protein Homology", "seqid": "JAIMBJ010000015.1", "type": "CDS", "score": ".", "end": 62558, "start": 61281, "phase": "0"}, {"source": "Genbank", "type": "gene", "strand": "+", "phase": ".", "end": 62558, "start": 61281, "score": ".", "seqid": "JAIMBJ010000015.1", "attributes": {"locus_tag": "K6U75_09905", "gene": "purD", "gbkey": "Gene", "Name": "purD", "ID": "gene-K6U75_09905", "gene_biotype": "protein_coding"}}, {"source": "Genbank", "score": ".", "start": 57956, "attributes": {"ID": "gene-K6U75_09890", "gbkey": "Gene", "Name": "K6U75_09890", "locus_tag": "K6U75_09890", "gene_biotype": "protein_coding"}, "phase": ".", "strand": "+", "type": "gene", "end": 59329, "seqid": "JAIMBJ010000015.1"}, {"start": 57956, "source": "Protein Homology", "score": ".", "attributes": {"locus_tag": "K6U75_09890", "transl_table": "11", "Parent": "gene-K6U75_09890", "protein_id": "MCL6475347.1", "product": "MlaD family protein", "Name": "MCL6475347.1", "ID": "cds-MCL6475347.1", "Dbxref": "NCBI_GP:MCL6475347.1", "gbkey": "CDS", "inference": "COORDINATES: protein motif:HMM:NF014523.2"}, "type": "CDS", "seqid": "JAIMBJ010000015.1", "strand": "+", "phase": "0", "end": 59329}, {"attributes": {"gene": "rplI", "gbkey": "Gene", "Name": "rplI", "gene_biotype": "protein_coding", "locus_tag": "K6U75_09895", "ID": "gene-K6U75_09895"}, "end": 59821, "source": "Genbank", "start": 59360, "type": "gene", "score": ".", "seqid": "JAIMBJ010000015.1", "phase": ".", "strand": "+"}, {"type": "gene", "start": 59880, "attributes": {"ID": "gene-K6U75_09900", "Name": "dnaB", "locus_tag": "K6U75_09900", "gene": "dnaB", "gene_biotype": "protein_coding", "gbkey": "Gene"}, "phase": ".", "strand": "+", "score": ".", "end": 61277, "source": "Genbank", "seqid": "JAIMBJ010000015.1"}, {"seqid": "JAIMBJ010000015.1", "score": ".", "type": "CDS", "start": 59880, "end": 61277, "attributes": {"Dbxref": "NCBI_GP:MCL6475349.1", "gene": "dnaB", "Parent": "gene-K6U75_09900", "protein_id": "MCL6475349.1", "transl_table": "11", "Name": "MCL6475349.1", "gbkey": "CDS", "inference": "COORDINATES: similar to AA sequence:RefSeq:WP_013121789.1", "locus_tag": "K6U75_09900", "ID": "cds-MCL6475349.1", "product": "replicative DNA helicase"}, "strand": "+", "phase": "0", "source": "Protein Homology"}, {"source": "Protein Homology", "score": ".", "type": "CDS", "end": 59821, "seqid": "JAIMBJ010000015.1", "attributes": {"protein_id": "MCL6475348.1", "transl_table": "11", "Parent": "gene-K6U75_09895", "inference": "COORDINATES: similar to AA sequence:RefSeq:WP_013824525.1", "locus_tag": "K6U75_09895", "ID": "cds-MCL6475348.1", "Name": "MCL6475348.1", "Dbxref": "NCBI_GP:MCL6475348.1", "gene": "rplI", "gbkey": "CDS", "product": "50S ribosomal protein L9"}, "strand": "+", "phase": "0", "start": 59360}], "is_reverse_complement": false, "end": 63027, "taxonomy": "d__Bacteria;p__Armatimonadota;c__HRBIN16;o__HRBIN16;f__HRBIN16;g__HRBIN16;s__HRBIN16 sp023511455", "start": 58423, "sequence": "CAACGCGCTGATTGCCGACCCGGAGATGCAGGCGGGCATCCGGCAAACGCTGAACAACGTGGAGCAGGCATCAGCCCGCCTCACCGTGCTGATAGAACAAGCCCGAGGTCTGCTGGCGAGCACGGGGCGTTCTCTGGATGCTGTGGTGCAGGACACCCGGCGTTCCACCAGAATATTGCCGGAAGTCATGGCGGACCTGAAGGCTTTGCTGGAGCAATCCCGCGAGCTGGTGCGCACCGCCCAGCACGCTACTGAGAGCTTCGACCGCTTTGTGAGCGATGAGCAGCTCCAGCGGAACTTGCGCGAGGTCGCCGCAGAGATGAACGCATTGAGTGCGAAACTGAATCGTATCGCCGAGGACATCGGCAAATACACGGGCGACGAAGAGCTCAGGGAGAACGTGCGCGGCACTGTCGCCGAAGCGCGGGCTACCCTTACGGAGGCACGTCAGGCAACGGAAAGGATTAATCGCTTTGTGGAGCGACTGATTCAACCCGGCAAGCTGTCCATCAGACCGACAGAGGTCTCTCTGGATGTGTACGGCTTGCTGCGTGACGGTAATTTTCGCACTGACCTGACGCTCTCGTTCCCGTATCGCGATAATCGCTTCTTTTATCTTGGCGTGTACGATGTTACCGAGAGCAACAAGTTCATCCTGCAGTATGGTTCACAGTTAGCGCCAACGCTGGATTTACGCTATGGACTCTACGCCTCCAAACCGGGGGTAGGGGTGGACTGGAGGTTCAAGCCCGGACTTCATCTGCGGGCGGATGCGTTTGACCCGAATGACCTGCAGATAAACACACGGGCGAAGATACAACTGAGCACAGACTGGAGTCTGTGGGTGGGTATCGACAGCCTGTTCGACCAGAACCAGCCGGTGTTGGGGGTGCAGTTGACACGTTAGGTTGATGGGAGGAGGCAAAAAACGATGGCGATGAAAGTGATACTCACGCAGGACGTGCCCTCGCTGGGCAAGCATGGCGAGGTGGTCAACGTCTCGGAGGGGTACGCACGGAACTATCTGTTCCCGCGTGGGCTGGCGATTGCTGCCGATAAGGGCGCGATGAAAAACGTTCAGCTTCGCCAGAAGCAGGAGGCGATGCGAGCAGAAAAAGCCGCGCAAGAAGCACGCCAGATTGCGGATGTGCTGAGAGGCAGGACAGTCACGGTGAAGGCACATGCCGGTAAGGGAACTACCAAACTGTTCGGCGCGGTGACTGCCCAACACATTGCAGATTCCATCGCACAGCAATACCATGTGAAGGTGGACAAGCGTAAAATCGGTTTGCTGGAGCCGATTAAGTCGCTGGGAGAGTATGAGGTTACACTGCACCTCCATCACGATGTGAACCTGACGTTGAAGGTGGAGGTCGTACCTCAGGAGGCTTCTGCATAAAGTTGGGTGTTGTGCGATGAGAAGGAATTCAGACAGGCAGGGCAAGCAAGAGTCACGCATGAGTGAGCTGCTGCGTGCCCTGCCGCACAATCTGGATGCGGAGATTGCCACGCTGGGTGCGATGATGATAGACGGCGCTGCCATCGAGCGCGTGAGCGAGTTCCTTGCGCCCGACGATTTCTATCGGGAGACGCACCGCGTGATTTATGATGCCCTGATAGCCCTTTCCGGCCGCAACGAGCCGGTGGACCTCGTCACCCTCAGTGAGGAACTGCAACGGCGCGGGAAGCTGGAAGAGGTCGGAGGTATCGCTTACCTGACCACTCTCATGGACAGCGTGCCCAGCGCGGCGAATGTGGACTACTATGCCCAGATTGTGGAAGAATATGCTATCCGCCGACGGCTTATCGAGGCGGCTCAGGAGATTATCCACATGGCAGCGGTGACCGGGGATGAGGAAGAACCGCTGTCTATTGGCGAAATAGCCGACCGGGCAGAGAGCGCGGTTTATCGGGTAGCGCAGCGGCGTATCGGCAAAGGCTTTGAAAGCATCCGCCCTTTGCTGGGTGAGGCGTTTGACCGCTTCGAAGCCTTTTATCATGAGCGCAAGCTGGTAACAGGTTTAAGCACGGGCTTTCGCGAACTGGACTTTGTCACTGCGGGGCTGCAGAAGTCGGACCTGGTGATTATTGCGGCACGCCCCAGCATGGGCAAAACCAGCTTGTGCCTGAACATCGGTGAGCACGTCGCCCTTCGTGAAGGCAAAACGGTGGCTATCTTCAGCCTGGAGATGTCCAAAGAGCAGCTCGTGCAGCGCATGATTTGCTCGCAGGCGGAGGTGAACGCGCATCGTTTGCGGTTGGGAATGCTGCCCGACACCGCATGGCAGAGGCTTGCCAAAGCGGTGCAGGAACTCTGGAATGCCAAGATATTCATCGATGACACGCCGGACATCTCGGTGCTGGAGATGCGGGCGAAGTGTCGCCGCTTGCGTGCCGAACACGGGTTGGACCTGGTGATTGTGGACTACCTGCAGCTGATGCGTAGCCACCGGCGAGCGGAGAACCGCACGCAGGAGATTTCGGACATCGCCCGCGCGCTGAAGGGGCTGGCGCGCGAGCTGGAAGTGCCCATTATTGCCCTGTCGCAGTTATCCCGTGCCGTAGAGCATCGCGAGAATAAGCGTCCAATGCTATCGGATTTGCGCGAGAGCGGTTCTATCGAAGCAGAAGCCGATGTGGTGGCATTCATCTACCGCCCCGAGTATTACGCGATGAAGGAGGCGGTCTCTACGGATGACGTGGAGGCAGGGACGATGCCTCGCGAGGAGGGGCGGGTCGAGGAAGCCGAGATTATCATTGCCAAACAACGAAACGGTCCAACGGGTACCGTGCGAGTCGGCTTCAAGCCCGATTATGCGCGGTTTGTCCCGCTGGAGCGACATCGGGAGGAATGAGCCTTGAAGGTACTGGTGATAGGTGGGGGTGGGCGCGAGCACGCACTGGTGTGGAAAATCGCGCAGAGCCCGAAAGTAGCCAAAATCTATGCGGCGCCGGGTAACGCAGGCATTGCTGAACTGGCGGAGTGCGTACCAATTAAAGCCACCGACATCGAAAGTCTGGCAAGCTTCGCCGAACGCAACCGTATTGACCTGACGGTTGTCGGACCCGAGTCGCCCTTGATTGCGGGTGTCGTGGATGTATTTGAGACGCGCGGGCTGGCGATATTTGGTCCCAGCAAAGAGCCAGCGCGTCTGGAAGGCAGTAAAGTGTACGCGAAGGAAGTCATGCGCCGTTATCGTATTCCCACCGCTGATTTCTCGGTTTTCAGTGAACCGCATTCGGCTGCGGAATACGCGCACCGTCGTTTTCAGGAAGGTGCGAAAGGACTGGTGATTAAAGCGGATGGCGAGGCGGGGGGTAAAGGCGTTTTCGTCGTGCATCATCTGGACGAGGCGCTGGAAGTGATACACTCGCTCATGGAGGAACGTGTGCTCGGAGAGGCAGGCGCGCGTGTGGTGATTGAAGAGGTTTTAGTTGGGGAAGAGGCTTCGCTGATGGCGTTTACCGACGGAGCCACTGTACTGCCTATGCTGCCGGTTCAGGATTATAAGCGCGCTCTCGACCACGACCGCGGAGCGAACACCGGTGGGATGGGCAGTATCTGTCCCCTGCGCCTTGTTACGCCGGAACTGCATCAGCAAGCGATAGAGCAGATTATCCATCCTGCCATCCGCGCCACGCGCGACGCCGGTATCCCCTTCCGCGGTGTGCTGTACGCAGGCACGATGGTTACCGAAGAGGGCATTAAGACACTGGAGTTCAATGTGCGATTCGGTGACCCTGAAACGCAGGCAGTCCTGCCCTTGCTGGAGAACGATATAGTGGAGGTGATGCAGGCGGCGGTCGAGTGCCGTCTGGATGAGGTTACCTTGCAGTGGAAACCGCGCTATTCGGTCTGTGTGGTGGTAGCATCGGGAGGCTACCCGGGCAAATATGAGACGGGTTTGCCGATCGAGGGGCTGGATGAGGCGGCGCAGGTGCCGGAATGCGTGGTCTTCCATGCAGGCACGCGCAGAGATGGCGATCAGGTGGTAACCGCTGGCGGGCGCGTGCTGGGAGTAACCGCCCTGGGCGATACGCTTGCTCAGGCGCGTGGGCGAGCCTACGAGGCGGTACGGTGTATCCGCTTCGAGTATATGCATTACCGCACAGACATCGGCATGAAGTGGGTATGAAAACTCGGTAAAAAAACGAAAGGAGGGATATGGATGCTAACCTGGCGGTATGTGGTGGCGGGCGGTTTGCTGCTGCTGATGTCTCTGGCGTGCCTGCAGGCGCATCCGCAGCCACCGACTGTGTTTCGCACCCAAGTAATATATCGGCACACGAACTCGGTACAGTCGGTACACATCAACAACAACGGCTGGGTTGTCTGGTCGGAGGGGGACTTCACCCGCAGTGATGTGTGGCTGTACGATGGCTCAGCAGTCCGTCGGCTGAGTGCAGGTGAACAGCGGCGTCTGAACACCTCTCCTCGTCTCAACAACCGGAACCAGATTGTGTGGCGTTACGATGATGGGGAAGTGTCTGACGTGGTACTGTGGGATAGCGGGATACTCACGAACATCACCCGCAGCGACGGCTCCGTGGGCTTTGGCGCACCGGATATCAATGACCTGGGCTGGGTGGTCACTACGGGAACGG", "length": 4605}