Genome annotation with prokka

Now that we have some extra information about our bins, we can continue to analyse the high-quality bins. The final CheckM results will give you a good overview of the bins with low contamination and high completeness and show the bin's lowest taxonomic rank. Pick a bin that you think is interesting to study further. Alternatively, you may also make a loop to annotate multiple bins.

With this selected bin(s), we are going to do genome annotation. Whole-genome annotation is the process of identifying features of interest in a set of genomic DNA sequences and labelling them with useful information. Prokka is a software tool to annotate bacterial, archaeal and viral genomes quickly and produce standards-compliant output files.

[DO:] make a directory for the prokka output:

In [ ]:
mkdir data/annotation

[DO:] read the prokka help page Remember to look for the usage line.

In [5]:
prokka -h
Option h is ambiguous (help, hmms)
Name:
  Prokka 1.13 by Torsten Seemann <torsten.seemann@gmail.com>
Synopsis:
  rapid bacterial genome annotation
Usage:
  prokka [options] <contigs.fasta>
General:
  --help            This help
  --version         Print version and exit
  --docs            Show full manual/documentation
  --citation        Print citation for referencing Prokka
  --quiet           No screen output (default OFF)
  --debug           Debug mode: keep all temporary files (default OFF)
Setup:
  --listdb          List all configured databases
  --setupdb         Index all installed databases
  --cleandb         Remove all database indices
  --depends         List all software dependencies
Outputs:
  --outdir [X]      Output folder [auto] (default '')
  --force           Force overwriting existing output folder (default OFF)
  --prefix [X]      Filename output prefix [auto] (default '')
  --addgenes        Add 'gene' features for each 'CDS' feature (default OFF)
  --addmrna         Add 'mRNA' features for each 'CDS' feature (default OFF)
  --locustag [X]    Locus tag prefix [auto] (default '')
  --increment [N]   Locus tag counter increment (default '1')
  --gffver [N]      GFF version (default '3')
  --compliant       Force Genbank/ENA/DDJB compliance: --addgenes --mincontiglen 200 --centre XXX (default OFF)
  --centre [X]      Sequencing centre ID. (default '')
  --accver [N]      Version to put in Genbank file (default '1')
Organism details:
  --genus [X]       Genus name (default 'Genus')
  --species [X]     Species name (default 'species')
  --strain [X]      Strain name (default 'strain')
  --plasmid [X]     Plasmid name or identifier (default '')
Annotations:
  --kingdom [X]     Annotation mode: Archaea|Bacteria|Mitochondria|Viruses (default 'Bacteria')
  --gcode [N]       Genetic code / Translation table (set if --kingdom is set) (default '0')
  --gram [X]        Gram: -/neg +/pos (default '')
  --usegenus        Use genus-specific BLAST databases (needs --genus) (default OFF)
  --proteins [X]    FASTA or GBK file to use as 1st priority (default '')
  --hmms [X]        Trusted HMM to first annotate from (default '')
  --metagenome      Improve gene predictions for highly fragmented genomes (default OFF)
  --rawproduct      Do not clean up /product annotation (default OFF)
  --cdsrnaolap      Allow [tr]RNA to overlap CDS (default OFF)
Computation:
  --cpus [N]        Number of CPUs to use [0=all] (default '8')
  --fast            Fast mode - only use basic BLASTP databases (default OFF)
  --noanno          For CDS just set /product="unannotated protein" (default OFF)
  --mincontiglen [N] Minimum contig size [NCBI needs 200] (default '1')
  --evalue [n.n]    Similarity e-value cut-off (default '1e-06')
  --rfam            Enable searching for ncRNAs with Infernal+Rfam (SLOW!) (default '0')
  --norrna          Don't run rRNA search (default OFF)
  --notrna          Don't run tRNA search (default OFF)
  --rnammer         Prefer RNAmmer over Barrnap for rRNA prediction (default OFF)
(metagenomics_practical) 

Use the options --centre X --compliant to stop prokkas complaints about ugly contig names.

[DO:] run prokka: This may take a while.

In [6]:
prokka data/bins/bin.1.fa --outdir data/annotation/bin1  --cpus 1 --centre X --compliant
[16:34:40] This is prokka 1.13
[16:34:40] Written by Torsten Seemann <torsten.seemann@gmail.com>
[16:34:40] Homepage is https://github.com/tseemann/prokka
[16:34:40] Local time is Tue Mar 22 16:34:40 2022
[16:34:40] You are laura
[16:34:40] Operating system is linux
[16:34:40] You have BioPerl 1.007002
[16:34:40] System has 4 cores.
[16:34:40] Will use maximum of 1 cores.
[16:34:40] Annotating as >>> Bacteria <<<
[16:34:40] Generating locus_tag from 'data/bins/bin.1.fa' contents.
[16:34:40] Setting --locustag HFHIMAGJ from MD5 1f126a0339e636dd81db632fa03976c5
[16:34:40] Enabling options to ensure Genbank/ENA/DDJB submission compliance.
[16:34:40] Creating new output folder: data/annotation/bin1
[16:34:40] Running: mkdir -p data\/annotation\/bin1
[16:34:40] Using filename prefix: PROKKA_03222022.XXX
[16:34:40] Setting HMMER_NCPU=1
[16:34:40] Writing log to: data/annotation/bin1/PROKKA_03222022.log
[16:34:40] Command: /opt/miniconda3/envs/metagenomics_practical/bin/prokka data/bins/bin.1.fa --outdir data/annotation/bin1 --cpus 1 --centre X --compliant
[16:34:40] Appending to PATH: /opt/miniconda3/envs/metagenomics_practical/bin
[16:34:40] Looking for 'aragorn' - found /opt/miniconda3/envs/metagenomics_practical/bin/aragorn
[16:34:40] Determined aragorn version is 1.2
[16:34:40] Looking for 'barrnap' - found /opt/miniconda3/envs/metagenomics_practical/bin/barrnap
[16:34:40] Determined barrnap version is 0.9
[16:34:40] Looking for 'blastp' - found /opt/miniconda3/envs/metagenomics_practical/bin/blastp
[16:34:40] Determined blastp version is 2.9
[16:34:40] Looking for 'cmpress' - found /opt/miniconda3/envs/metagenomics_practical/bin/cmpress
[16:34:40] Determined cmpress version is 1.1
[16:34:40] Looking for 'cmscan' - found /opt/miniconda3/envs/metagenomics_practical/bin/cmscan
[16:34:40] Determined cmscan version is 1.1
[16:34:40] Looking for 'egrep' - found /bin/egrep
[16:34:40] Looking for 'find' - found /usr/bin/find
[16:34:40] Looking for 'grep' - found /bin/grep
[16:34:40] Looking for 'hmmpress' - found /opt/miniconda3/envs/metagenomics_practical/bin/hmmpress
[16:34:40] Determined hmmpress version is 3.3
[16:34:40] Looking for 'hmmscan' - found /opt/miniconda3/envs/metagenomics_practical/bin/hmmscan
[16:34:40] Determined hmmscan version is 3.3
[16:34:40] Looking for 'java' - found /opt/miniconda3/envs/metagenomics_practical/bin/java
[16:34:40] Looking for 'less' - found /usr/bin/less
[16:34:40] Looking for 'makeblastdb' - found /opt/miniconda3/envs/metagenomics_practical/bin/makeblastdb
[16:34:40] Determined makeblastdb version is 2.9
[16:34:40] Looking for 'minced' - found /opt/miniconda3/envs/metagenomics_practical/bin/minced
[16:34:41] Determined minced version is 4.2
[16:34:41] Looking for 'parallel' - found /opt/miniconda3/envs/metagenomics_practical/bin/parallel
[16:34:41] Determined parallel version is 20160622
[16:34:41] Looking for 'prodigal' - found /opt/miniconda3/envs/metagenomics_practical/bin/prodigal
[16:34:41] Determined prodigal version is 2.6
[16:34:41] Looking for 'prokka-genbank_to_fasta_db' - found /opt/miniconda3/envs/metagenomics_practical/bin/prokka-genbank_to_fasta_db
[16:34:41] Looking for 'sed' - found /bin/sed
[16:34:41] Looking for 'tbl2asn' - found /opt/miniconda3/envs/metagenomics_practical/bin/tbl2asn
[16:34:41] Determined tbl2asn version is 25.8
[16:34:41] Using genetic code table 11.
[16:34:41] Loading and checking input file: data/bins/bin.1.fa
[16:34:41] Wrote 37 contigs totalling 4425020 bp.
[16:34:41] Predicting tRNAs and tmRNAs
[16:34:41] Running: aragorn -l -gc11  -w data\/annotation\/bin1\/PROKKA_03222022\.fna
[16:34:44] 1 tRNA-Ser [31831,31923] 35 (gct)
[16:34:44] 2 tRNA-Leu [59288,59374] 35 (tag)
[16:34:44] 3 tRNA-Gly [65460,65535] 34 (gcc)
[16:34:44] 4 tRNA-Cys [65630,65703] 33 (gca)
[16:34:44] 5 tRNA-Leu [65763,65854] 35 (taa)
[16:34:44] 6 tRNA-Val [281855,281929] 33 (gac)
[16:34:44] 7 tRNA-Leu c[294270,294354] 35 (cag)
[16:34:44] 8 tRNA-Trp c[379468,379558] 31 (cca)
[16:34:44] 9 tRNA-Asn c[434319,434394] 34 (gtt)
[16:34:44] 10 tRNA-Pro [523818,523894] 35 (tgg)
[16:34:44] 11 tRNA-Arg [568516,568592] 35 (tct)
[16:34:44] 12 tRNA-Lys [677102,677177] 34 (ttt)
[16:34:44] 1 tRNA-Leu c[30915,30999] 35 (gag)
[16:34:44] 2 tRNA-Phe [164483,164576] 35 (gaa)
[16:34:44] 3 tRNA-Ala [191584,191659] 34 (cgc)
[16:34:44] 4 tRNA-Lys [191686,191761] 34 (ctt)
[16:34:44] 5 tRNA-Pro [429715,429791] 35 (cgg)
[16:34:44] 1 tRNA-Phe [13659,13749] 33 (gaa)
[16:34:44] 2 tRNA-Pro [132617,132709] 33 (ggg)
[16:34:44] 3 tRNA-Pro c[138934,139010] 35 (ggg)
[16:34:44] 4 tRNA-His c[276316,276391] 34 (gtg)
[16:34:44] 5 tRNA-Val c[278421,278496] 34 (tac)
[16:34:44] tRNA [23282,23373] is a pseudo/wacky gene - skipping.
[16:34:44] 2 tRNA-Leu [88100,88184] 35 (caa)
[16:34:44] 1 tRNA-Arg c[32552,32627] 34 (cct)
[16:34:44] 2 tRNA-Met [68152,68228] 35 (cat)
[16:34:44] 1 tRNA-Thr c[173609,173684] 34 (tgt)
[16:34:44] 1 tRNA-Glu [77,152] 35 (ttc)
[16:34:44] 2 tRNA-Asp [223,299] 35 (gtc)
[16:34:44] 1 tRNA-Ser [20115,20205] 35 (cga)
[16:34:44] 2 tRNA-Thr c[108431,108506] 34 (cgt)
[16:34:44] 1 tmRNA [65777,66163] 91,123 ANDERFALAA*
[16:34:44] 1 tRNA-Ser c[6269,6359] 35 (gga)
[16:34:44] 1 tRNA-Arg [31943,32018] 34 (ccg)
[16:34:44] 2 tRNA-Met c[32532,32608] 35 (cat)
[16:34:44] 1 tRNA-Gln c[30163,30239] 35 (ttg)
[16:34:44] 2 tRNA-Met [41759,41835] 35 (cat)
[16:34:44] 3 tRNA-Arg [41895,41971] 35 (acg)
[16:34:44] 4 tRNA-Ser c[55396,55485] 35 (tga)
[16:34:44] 1 tRNA-Gly [27395,27468] 33 (ccc)
[16:34:44] 1 tRNA-Phe [7523,7598] 34 (gaa)
[16:34:44] 2 tRNA-Gly [7643,7718] 34 (gcc)
[16:34:44] 1 tRNA-Val [20761,20835] 33 (cac)
[16:34:44] 1 tRNA-Tyr [24455,24540] 35 (gta)
[16:34:44] 2 tRNA-Gly [24620,24693] 33 (tcc)
[16:34:44] 3 tRNA-Thr [24769,24843] 33 (ggt)
[16:34:44] Found 45 tRNAs
[16:34:44] Predicting Ribosomal RNAs
[16:34:44] Running Barrnap with 1 threads
[16:34:47] Found 0 rRNAs
[16:34:47] Skipping ncRNA search, enable with --rfam if desired.
[16:34:47] Total of 44 tRNA + rRNA features
[16:34:47] Searching for CRISPR repeats
[16:34:48] Found 0 CRISPRs
[16:34:48] Predicting coding sequences
[16:34:48] Contigs total 4425020 bp, so using single mode
[16:34:48] Running: prodigal -i data\/annotation\/bin1\/PROKKA_03222022\.fna -c -m -g 11 -p single -f sco -q
[16:34:53] Excluding CDS which overlaps existing RNA (tRNA) at gnl|X|HFHIMAGJ_1:377899..379779 on + strand
[16:34:54] Excluding CDS which overlaps existing RNA (tRNA) at gnl|X|HFHIMAGJ_2:164142..165458 on - strand
[16:34:54] Excluding CDS which overlaps existing RNA (tRNA) at gnl|X|HFHIMAGJ_3:13016..15067 on - strand
[16:34:54] Excluding CDS which overlaps existing RNA (tRNA) at gnl|X|HFHIMAGJ_3:131703..133025 on + strand
[16:34:57] Found 4294 CDS
[16:34:57] Connecting features back to sequences
[16:34:57] Not using genus-specific database. Try --usegenus to enable it.
[16:34:57] Annotating CDS, please be patient.
[16:34:57] Will use 1 CPUs for similarity searching.
[16:34:58] There are still 4294 unannotated CDS left (started with 4294)
[16:34:58] Will use blast to search against /opt/miniconda3/envs/metagenomics_practical/bin/../db/kingdom/Bacteria/sprot with 1 CPUs
[16:34:58] Running: cat data\/annotation\/bin1\/sprot\.faa | parallel --gnu --plain -j 1 --block 696633 --recstart '>' --pipe blastp -query - -db /opt/miniconda3/envs/metagenomics_practical/bin/../db/kingdom/Bacteria/sprot -evalue 1e-06 -num_threads 1 -num_descriptions 1 -num_alignments 1 -seg no > data\/annotation\/bin1\/sprot\.blast 2> /dev/null
[16:39:41] Modify product: UPF0065 protein in clcB-clcD intergenic region => hypothetical protein
[16:39:41] Modify product: UPF0065 protein in clcB-clcD intergenic region => hypothetical protein
[16:39:41] Modify product: Uncharacterized protein HI_1163 => putative protein
[16:39:41] Modify product: UPF0065 protein in clcB-clcD intergenic region => hypothetical protein
[16:39:41] Modify product: UPF0065 protein in clcB-clcD intergenic region => hypothetical protein
[16:39:41] Modify product: Uncharacterized protein YisK => putative protein YisK
[16:39:41] Modify product: Putative universal stress protein SA1532 => Putative universal stress protein
[16:39:42] Modify product: UPF0394 inner membrane protein YeeE => hypothetical protein
[16:39:42] Modify product: UPF0065 protein in clcB-clcD intergenic region => hypothetical protein
[16:39:42] Modify product: Uncharacterized membrane protein Rv2637 => putative membrane protein
[16:39:42] Modify product: UPF0065 protein in clcB-clcD intergenic region => hypothetical protein
[16:39:42] Modify product: Uncharacterized protein YciH => putative protein YciH
[16:39:42] Modify product: Uncharacterized MFS-type transporter YhjX => putative MFS-type transporter YhjX
[16:39:42] Modify product: Uncharacterized NTE family protein Rv1063c => putative NTE family protein
[16:39:42] Modify product: Uncharacterized oxidoreductase SA2266 => putative oxidoreductase
[16:39:42] Modify product: RNA-binding protein HI_1333 => RNA-binding protein
[16:39:42] Modify product: Phosphate-specific transport system accessory protein PhoU homolog => Phosphate-specific transport system accessory protein PhoU 
[16:39:42] Modify product: Uncharacterized oxidoreductase YdgJ => putative oxidoreductase YdgJ
[16:39:42] Modify product: UPF0394 inner membrane protein YeeE => hypothetical protein
[16:39:42] Modify product: Uncharacterized N-acetyltransferase YsnE => putative N-acetyltransferase YsnE
[16:39:42] Modify product: Probable Fe(2+)-trafficking protein => putative Fe(2+)-trafficking protein
[16:39:42] Modify product: Uncharacterized HTH-type transcriptional regulator HI_0186 => putative HTH-type transcriptional regulator
[16:39:42] Modify product: Uncharacterized oxidoreductase SA2266 => putative oxidoreductase
[16:39:42] Modify product: Putative HTH-type transcriptional regulator Rv0890c => Putative HTH-type transcriptional regulator
[16:39:42] Modify product: Putative HTH-type transcriptional regulator Rv0890c => Putative HTH-type transcriptional regulator
[16:39:42] Modify product: Leu/Ile/Val-binding protein homolog 3 => Leu/Ile/Val-binding protein 
[16:39:42] Modify product: Uncharacterized protein PA3435 => putative protein
[16:39:42] Modify product: Probable L-asparaginase => putative L-asparaginase
[16:39:42] Modify product: UPF0434 protein NMA0874 => hypothetical protein
[16:39:42] Modify product: Probable sulfite/organosulfonate exporter TauE => putative sulfite/organosulfonate exporter TauE
[16:39:42] Modify product: UPF0065 protein in clcB-clcD intergenic region => hypothetical protein
[16:39:42] Modify product: Uncharacterized protein YisK => putative protein YisK
[16:39:42] Modify product: UPF0065 protein in clcB-clcD intergenic region => hypothetical protein
[16:39:42] Modify product: UPF0065 protein in clcB-clcD intergenic region => hypothetical protein
[16:39:42] Modify product: Uncharacterized oxidoreductase SA2266 => putative oxidoreductase
[16:39:42] Modify product: Probable transcriptional regulatory protein aq_1575 => putative transcriptional regulatory protein
[16:39:42] Modify product: UPF0065 protein in clcB-clcD intergenic region => hypothetical protein
[16:39:42] Modify product: UPF0311 protein RPA1785 => hypothetical protein
[16:39:42] Modify product: UPF0065 protein in clcB-clcD intergenic region => hypothetical protein
[16:39:42] Modify product: Probable formate transporter 1 => putative formate transporter 1
[16:39:42] Modify product: Stringent starvation protein A homolog => Stringent starvation protein A 
[16:39:42] Modify product: UPF0065 protein in clcB-clcD intergenic region => hypothetical protein
[16:39:42] Modify product: Purine-binding protein BAB2_0673 => Purine-binding protein
[16:39:42] Modify product: Uncharacterized ABC transporter ATP-binding protein YheS => putative ABC transporter ATP-binding protein YheS
[16:39:42] Modify product: Uncharacterized signaling protein PA1727 => putative signaling protein
[16:39:42] Modify product: Putative multidrug export ATP-binding/permease protein SA1683 => Putative multidrug export ATP-binding/permease protein
[16:39:42] Modify product: Probable transcriptional regulatory protein pdtaR => putative transcriptional regulatory protein pdtaR
[16:39:42] Modify product: Uncharacterized protein YbiB => putative protein YbiB
[16:39:42] Modify product: Uncharacterized protein HI_0933 => putative protein
[16:39:42] Modify product: Uncharacterized protein YqeY => putative protein YqeY
[16:39:42] Modify product: UPF0307 protein PSPTO_4464 => hypothetical protein
[16:39:42] Modify product: Acetyltransferase Atu2258 => Acetyltransferase
[16:39:42] Modify product: UPF0073 inner membrane protein YqfA => hypothetical protein
[16:39:42] Modify product: Uncharacterized ABC transporter ATP-binding protein YadG => putative ABC transporter ATP-binding protein YadG
[16:39:42] Modify product: Probable diguanylate cyclase DgcE => putative diguanylate cyclase DgcE
[16:39:42] Modify product: Uncharacterized protein Rv1364c => putative protein
[16:39:42] Modify product: Putative short-chain type dehydrogenase/reductase Rv0148 => Putative short-chain type dehydrogenase/reductase
[16:39:42] Modify product: Probable xylitol oxidase => putative xylitol oxidase
[16:39:42] Modify product: UPF0065 protein in clcB-clcD intergenic region => hypothetical protein
[16:39:42] Modify product: UPF0353 protein Rv1481 => hypothetical protein
[16:39:42] Modify product: Uncharacterized protein Rv2307c => putative protein
[16:39:42] Modify product: DNA-binding protein VF_0530 => DNA-binding protein
[16:39:42] Modify product: Uncharacterized protein YqjD => putative protein YqjD
[16:39:42] Modify product: Uncharacterized protein Rv1367c => putative protein
[16:39:42] Modify product: NADH dehydrogenase-like protein SA0802 => NADH dehydrogenase-like protein
[16:39:42] Modify product: Cyclic nucleotide-gated potassium channel mll3241 => Cyclic nucleotide-gated potassium channel
[16:39:42] Modify product: UPF0065 protein in clcB-clcD intergenic region => hypothetical protein
[16:39:42] Modify product: Probable glutamine ABC transporter permease protein GlnM => putative glutamine ABC transporter permease protein GlnM
[16:39:42] Modify product: Probable fructoselysine utilization operon transcriptional repressor => putative fructoselysine utilization operon transcriptional repressor
[16:39:43] Modify product: UPF0053 inner membrane protein YgdQ => hypothetical protein
[16:39:43] Modify product: Uncharacterized lipoprotein YfhM => putative lipoprotein YfhM
[16:39:43] Modify product: UPF0065 protein in clcB-clcD intergenic region => hypothetical protein
[16:39:43] Modify product: Probable D,D-dipeptide transport ATP-binding protein DdpF => putative D,D-dipeptide transport ATP-binding protein DdpF
[16:39:43] Modify product: Probable D,D-dipeptide transport system permease protein DdpC => putative D,D-dipeptide transport system permease protein DdpC
[16:39:43] Modify product: UPF0246 protein YaaA => hypothetical protein
[16:39:43] Modify product: Probable lipoprotein YiaD => putative lipoprotein YiaD
[16:39:43] Modify product: Probable D,D-dipeptide transport ATP-binding protein DdpF => putative D,D-dipeptide transport ATP-binding protein DdpF
[16:39:43] Modify product: Monocarboxylate 2-oxoacid-binding periplasmic protein all3028 => Monocarboxylate 2-oxoacid-binding periplasmic protein
[16:39:43] Modify product: Probable phosphite transport system-binding protein HtxB => putative phosphite transport system-binding protein HtxB
[16:39:43] Modify product: Probable GTPase Rv1496 => putative GTPase
[16:39:43] Modify product: RutC family protein HI_0719 => RutC family protein
[16:39:43] Modify product: Probable enoyl-CoA hydratase echA12 => putative enoyl-CoA hydratase echA12
[16:39:43] Modify product: Putative quercetin 2,3-dioxygenase Rv0181c => Putative quercetin 2,3-dioxygenase
[16:39:43] Modify product: Probable hydrolase YcaC => putative hydrolase YcaC
[16:39:43] Modify product: Putative HTH-type transcriptional regulator Rv0890c => Putative HTH-type transcriptional regulator
[16:39:43] Modify product: Putative HTH-type transcriptional regulator Rv0890c => Putative HTH-type transcriptional regulator
[16:39:43] Modify product: Putative HTH-type transcriptional regulator Rv0890c => Putative HTH-type transcriptional regulator
[16:39:43] Modify product: UPF0065 protein in clcB-clcD intergenic region => hypothetical protein
[16:39:43] Modify product: Probable 5-dehydro-4-deoxyglucarate dehydratase => putative 5-dehydro-4-deoxyglucarate dehydratase
[16:39:43] Modify product: Uncharacterized protein YphB => putative protein YphB
[16:39:43] Modify product: UPF0065 protein in clcB-clcD intergenic region => hypothetical protein
[16:39:43] Modify product: Probable D-xylose utilization operon transcriptional repressor => putative D-xylose utilization operon transcriptional repressor
[16:39:43] Modify product: Probable lipoprotein YiaD => putative lipoprotein YiaD
[16:39:43] Modify product: Solute-binding protein Dde_0634 => Solute-binding protein
[16:39:43] Modify product: Uncharacterized inner membrane transporter YedA => putative inner membrane transporter YedA
[16:39:43] Modify product: UPF0065 protein in clcB-clcD intergenic region => hypothetical protein
[16:39:43] Modify product: UPF0394 membrane protein XF_0765 => hypothetical protein
[16:39:43] Modify product: UPF0394 membrane protein XF_0766 => hypothetical protein
[16:39:43] Modify product: Probable glutamine ABC transporter permease protein GlnM => putative glutamine ABC transporter permease protein GlnM
[16:39:43] Modify product: Uncharacterized FAD-linked oxidoreductase Rv2280 => putative FAD-linked oxidoreductase
[16:39:43] Modify product: UPF0065 protein in clcB-clcD intergenic region => hypothetical protein
[16:39:43] Modify product: Probable TonB-dependent receptor BfrD => putative TonB-dependent receptor BfrD
[16:39:43] Modify product: PKHD-type hydroxylase Sbal_3634 => PKHD-type hydroxylase
[16:39:43] Modify product: UPF0337 protein YjbJ => hypothetical protein
[16:39:43] Modify product: Purine-binding protein BAB2_0673 => Purine-binding protein
[16:39:43] Modify product: Uncharacterized NTE family protein Rv1063c => putative NTE family protein
[16:39:43] Modify product: Uncharacterized oxidoreductase Rv1144 => putative oxidoreductase
[16:39:43] Modify product: Probable glutamine ABC transporter permease protein GlnM => putative glutamine ABC transporter permease protein GlnM
[16:39:43] Modify product: UPF0065 protein in clcB-clcD intergenic region => hypothetical protein
[16:39:43] Modify product: Monocarboxylate 2-oxoacid-binding periplasmic protein all3028 => Monocarboxylate 2-oxoacid-binding periplasmic protein
[16:39:43] Modify product: Uncharacterized protein YibN => putative protein YibN
[16:39:43] Modify product: Probable CtpA-like serine protease => putative CtpA-like serine protease
[16:39:43] Modify product: Uncharacterized protein Rv2568c => putative protein
[16:39:43] Modify product: Probable sulfate transporter Rv1739c => putative sulfate transporter
[16:39:43] Modify product: UPF0235 protein YggU => hypothetical protein
[16:39:43] Modify product: Probable protein kinase UbiB => putative protein kinase UbiB
[16:39:43] Modify product: Uncharacterized FAD-linked oxidoreductase Rv2280 => putative FAD-linked oxidoreductase
[16:39:43] Modify product: UPF0065 protein in clcB-clcD intergenic region => hypothetical protein
[16:39:43] Modify product: UPF0502 protein YceH => hypothetical protein
[16:39:43] Modify product: Uncharacterized metal-dependent hydrolase YjjV => putative metal-dependent hydrolase YjjV
[16:39:43] Modify product: Uncharacterized protein Rv1480 => putative protein
[16:39:43] Modify product: UPF0353 protein Rv1481 => hypothetical protein
[16:39:43] Modify product: Purine-binding protein BAB2_0673 => Purine-binding protein
[16:39:43] Modify product: Purine-binding protein BAB2_0673 => Purine-binding protein
[16:39:43] Modify product: UPF0065 protein in clcB-clcD intergenic region => hypothetical protein
[16:39:43] Modify product: Uncharacterized protein HI_1028 => putative protein
[16:39:43] Modify product: UPF0065 protein in clcB-clcD intergenic region => hypothetical protein
[16:39:43] Modify product: Putative thiamine biosynthesis protein HI_0357 => Putative thiamine biosynthesis protein
[16:39:43] Modify product: Probable enoyl-CoA hydratase 1 => putative enoyl-CoA hydratase 1
[16:39:43] Modify product: NAD(P)H-dependent FMN reductase PA1204 => NAD(P)H-dependent FMN reductase
[16:39:43] Modify product: UPF0301 protein SO_3346 => hypothetical protein
[16:39:43] Modify product: Probable tRNA-dihydrouridine synthase => putative tRNA-dihydrouridine synthase
[16:39:43] Modify product: Probable diguanylate cyclase DgcC => putative diguanylate cyclase DgcC
[16:39:43] Modify product: Uncharacterized protein YcbX => putative protein YcbX
[16:39:43] Modify product: Probable thiol:disulfide interchange protein DsbC => putative thiol:disulfide interchange protein DsbC
[16:39:43] Modify product: Probable enoyl-CoA hydratase echA8 => putative enoyl-CoA hydratase echA8
[16:39:43] Modify product: UPF0065 protein in clcB-clcD intergenic region => hypothetical protein
[16:39:43] Modify product: Probable FKBP-type peptidyl-prolyl cis-trans isomerase => putative FKBP-type peptidyl-prolyl cis-trans isomerase
[16:39:43] Modify product: Uncharacterized protein YisK => putative protein YisK
[16:39:43] Modify product: UPF0065 protein in clcB-clcD intergenic region => hypothetical protein
[16:39:43] Modify product: UPF0065 protein in clcB-clcD intergenic region => hypothetical protein
[16:39:44] Modify product: UPF0065 protein in clcB-clcD intergenic region => hypothetical protein
[16:39:44] Modify product: UPF0065 protein in clcB-clcD intergenic region => hypothetical protein
[16:39:44] Modify product: Probable fatty acid methyltransferase Rv3720 => putative fatty acid methyltransferase
[16:39:44] Modify product: Uncharacterized protein HI_0131 => putative protein
[16:39:44] Modify product: Probable type II secretion system protein HxcR => putative type II secretion system protein HxcR
[16:39:44] Modify product: Uncharacterized protein HI_0131 => putative protein
[16:39:44] Modify product: UPF0065 protein in clcB-clcD intergenic region => hypothetical protein
[16:39:44] Modify product: Uncharacterized oxidoreductase YdgJ => putative oxidoreductase YdgJ
[16:39:44] Modify product: Uncharacterized protein Rv0906 => putative protein
[16:39:44] Modify product: UPF0250 protein YbeD => hypothetical protein
[16:39:44] Modify product: Uncharacterized protein HI_0131 => putative protein
[16:39:44] Modify product: Probable Ni/Fe-hydrogenase 1 B-type cytochrome subunit => putative Ni/Fe-hydrogenase 1 B-type cytochrome subunit
[16:39:44] Modify product: Peroxiredoxin sll1621 => Peroxiredoxin
[16:39:44] Modify product: Uncharacterized GST-like protein YibF => putative GST-like protein YibF
[16:39:44] Modify product: Uncharacterized protein YaeQ => putative protein YaeQ
[16:39:44] Modify product: UPF0053 inner membrane protein YoaE => hypothetical protein
[16:39:44] Modify product: Uncharacterized glycosyltransferase Rv0539 => putative glycosyltransferase
[16:39:44] Modify product: UPF0053 inner membrane protein YoaE => hypothetical protein
[16:39:44] Modify product: Uncharacterized protein YisK => putative protein YisK
[16:39:44] Modify product: Probable enoyl-CoA hydratase echA12 => putative enoyl-CoA hydratase echA12
[16:39:44] Modify product: Uncharacterized protein Rv0628c => putative protein
[16:39:44] Modify product: Purine-binding protein BAB2_0673 => Purine-binding protein
[16:39:44] Modify product: UPF0065 protein in clcB-clcD intergenic region => hypothetical protein
[16:39:44] Modify product: UPF0065 protein in clcB-clcD intergenic region => hypothetical protein
[16:39:44] Modify product: UPF0065 protein in clcB-clcD intergenic region => hypothetical protein
[16:39:44] Modify product: Putative glutamine amidotransferase Rv2859c => Putative glutamine amidotransferase
[16:39:44] Modify product: Uncharacterized protein HI_0131 => putative protein
[16:39:44] Modify product: Probable D-xylose utilization operon transcriptional repressor => putative D-xylose utilization operon transcriptional repressor
[16:39:44] Modify product: Purine-binding protein BAB2_0673 => Purine-binding protein
[16:39:44] Modify product: Putative quercetin 2,3-dioxygenase Rv0181c => Putative quercetin 2,3-dioxygenase
[16:39:44] Modify product: UPF0702 transmembrane protein YetF => hypothetical protein
[16:39:44] Modify product: Uncharacterized oxidoreductase Rv0945 => putative oxidoreductase
[16:39:44] Modify product: Probable GTP-binding protein EngB => putative GTP-binding protein EngB
[16:39:44] Modify product: UPF0065 protein in clcB-clcD intergenic region => hypothetical protein
[16:39:44] Modify product: UPF0065 protein in clcB-clcD intergenic region => hypothetical protein
[16:39:44] Modify product: UPF0065 protein in clcB-clcD intergenic region => hypothetical protein
[16:39:44] Modify product: Leu/Ile/Val-binding protein homolog 3 => Leu/Ile/Val-binding protein 
[16:39:44] Modify product: Leu/Ile/Val-binding protein homolog 3 => Leu/Ile/Val-binding protein 
[16:39:44] Modify product: Uncharacterized ABC transporter ATP-binding protein Rv2564 => putative ABC transporter ATP-binding protein
[16:39:44] Modify product: Uncharacterized protein YpbG => putative protein YpbG
[16:39:44] Modify product: UPF0225 protein YchJ => hypothetical protein
[16:39:44] Modify product: Uncharacterized GTP-binding protein YjiA => putative GTP-binding protein YjiA
[16:39:44] Modify product: Uncharacterized signaling protein PA1727 => putative signaling protein
[16:39:44] Modify product: Probable amino-acid permease protein YxeN => putative amino-acid permease protein YxeN
[16:39:44] Modify product: Probable xanthine dehydrogenase subunit C => putative xanthine dehydrogenase subunit C
[16:39:44] Modify product: Probable xanthine dehydrogenase subunit D => putative xanthine dehydrogenase subunit D
[16:39:44] Modify product: Probable xanthine dehydrogenase subunit E => putative xanthine dehydrogenase subunit E
[16:39:44] Modify product: UPF0065 protein in clcB-clcD intergenic region => hypothetical protein
[16:39:44] Modify product: Uncharacterized acyl-CoA thioester hydrolase HI_0827 => putative acyl-CoA thioester hydrolase
[16:39:44] Modify product: UPF0213 protein BH0048 => hypothetical protein
[16:39:44] Modify product: Uncharacterized HTH-type transcriptional regulator YybR => putative HTH-type transcriptional regulator YybR
[16:39:44] Modify product: UPF0176 protein lpg2838 => hypothetical protein
[16:39:44] Modify product: Probable phosphite transport system-binding protein PtxB => putative phosphite transport system-binding protein PtxB
[16:39:44] Modify product: Probable 2,4-dienoyl-CoA reductase => putative 2,4-dienoyl-CoA reductase
[16:39:44] Modify product: Probable ECF RNA polymerase sigma factor SigI => putative ECF RNA polymerase sigma factor SigI
[16:39:44] Modify product: UPF0126 inner membrane protein YicG => hypothetical protein
[16:39:44] Modify product: Uncharacterized NAD-dependent oxidoreductase MAP_4146 => putative NAD-dependent oxidoreductase
[16:39:44] Modify product: Probable 3-hydroxyacyl-CoA dehydrogenase => putative 3-hydroxyacyl-CoA dehydrogenase
[16:39:44] Modify product: Probable undecaprenyl-phosphate N-acetylglucosaminyl 1-phosphate transferase => putative undecaprenyl-phosphate N-acetylglucosaminyl 1-phosphate transferase
[16:39:44] Modify product: Probable S-adenosylmethionine-dependent methyltransferase MSMEG_2350/MSMEI_2290 => putative S-adenosylmethionine-dependent methyltransferase/MSMEI_2290
[16:39:44] Modify product: Uncharacterized protein RP789 => putative protein RP789
[16:39:44] Modify product: Probable diacyglycerol O-acyltransferase tgs1 => putative diacyglycerol O-acyltransferase tgs1
[16:39:44] Modify product: Stringent starvation protein B homolog => Stringent starvation protein B 
[16:39:44] Modify product: Leu/Ile/Val-binding protein homolog 3 => Leu/Ile/Val-binding protein 
[16:39:44] Modify product: Probable enoyl-CoA hydratase echA8 => putative enoyl-CoA hydratase echA8
[16:39:44] Modify product: Probable bifunctional transcriptional activator/DNA repair enzyme AlkA => putative bifunctional transcriptional activator/DNA repair enzyme AlkA
[16:39:44] Modify product: Sulfurtransferase Alvin_2599 => Sulfurtransferase
[16:39:44] Modify product: Probable TonB-dependent receptor NMB0964 => putative TonB-dependent receptor
[16:39:44] Modify product: Uncharacterized protein YyaP => putative protein YyaP
[16:39:44] Modify product: Probable hydrolase YcaC => putative hydrolase YcaC
[16:39:44] Modify product: Probable copper-importing P-type ATPase A => putative copper-importing P-type ATPase A
[16:39:44] Modify product: UPF0065 protein in clcB-clcD intergenic region => hypothetical protein
[16:39:45] Modify product: Uncharacterized FAD-linked oxidoreductase Rv2280 => putative FAD-linked oxidoreductase
[16:39:45] Modify product: Uncharacterized MFS-type transporter YhjX => putative MFS-type transporter YhjX
[16:39:45] Modify product: Probable 2-dehydro-3-deoxygalactonokinase DgoK1 => putative 2-dehydro-3-deoxygalactonokinase DgoK1
[16:39:45] Modify product: Probable D,D-dipeptide transport system permease protein DdpC => putative D,D-dipeptide transport system permease protein DdpC
[16:39:45] Modify product: Solute-binding protein Veis_3954 => Solute-binding protein
[16:39:45] Modify product: Uncharacterized aminopeptidase Rv1333 => putative aminopeptidase
[16:39:45] Modify product: Probable copper-importing P-type ATPase A => putative copper-importing P-type ATPase A
[16:39:45] Modify product: Uncharacterized protein Rv0906 => putative protein
[16:39:45] Modify product: Protease HtpX homolog => Protease HtpX 
[16:39:45] Modify product: Probable DNA endonuclease SmrA => putative DNA endonuclease SmrA
[16:39:45] Modify product: UPF0758 protein YsxA => hypothetical protein
[16:39:45] Modify product: Uncharacterized protein YciO => putative protein YciO
[16:39:45] Modify product: Uncharacterized protein Rv2575 => putative protein
[16:39:45] Modify product: Chemotaxis protein CheY homolog => Chemotaxis protein CheY 
[16:39:45] Modify product: UPF0053 inner membrane protein YtfL => hypothetical protein
[16:39:45] Modify product: Uncharacterized protein Rv2715 => putative protein
[16:39:45] Modify product: Probable glycerol-3-phosphate acyltransferase => putative glycerol-3-phosphate acyltransferase
[16:39:45] Modify product: UPF0234 protein HI_1034 => hypothetical protein
[16:39:45] Modify product: Uncharacterized ABC transporter ATP-binding protein TM_0288 => putative ABC transporter ATP-binding protein
[16:39:45] Modify product: Uncharacterized MFS-type transporter YfcJ => putative MFS-type transporter YfcJ
[16:39:45] Modify product: Uncharacterized protein Rv2569c => putative protein
[16:39:45] Modify product: UPF0153 protein YeiW => hypothetical protein
[16:39:45] Modify product: Probable chromosome-partitioning protein ParB => putative chromosome-partitioning protein ParB
[16:39:45] Modify product: Uncharacterized FAD-linked oxidoreductase Rv2280 => putative FAD-linked oxidoreductase
[16:39:45] Modify product: Putative esterase HI_1161 => Putative esterase
[16:39:45] Modify product: UPF0065 protein in clcB-clcD intergenic region => hypothetical protein
[16:39:45] Modify product: Outer membrane porin protein BP0840 => Outer membrane porin protein
[16:39:45] Modify product: UPF0065 protein in clcB-clcD intergenic region => hypothetical protein
[16:39:45] Modify product: UPF0173 metal-dependent hydrolase SA1529 => hypothetical protein
[16:39:45] Modify product: UPF0065 protein in clcB-clcD intergenic region => hypothetical protein
[16:39:45] Modify product: UPF0065 protein in clcB-clcD intergenic region => hypothetical protein
[16:39:45] Modify product: UPF0065 protein in clcB-clcD intergenic region => hypothetical protein
[16:39:45] Modify product: Probable tartrate dehydrogenase/decarboxylase TtuC' => putative tartrate dehydrogenase/decarboxylase TtuC'
[16:39:45] Modify product: UPF0065 protein in clcB-clcD intergenic region => hypothetical protein
[16:39:45] Modify product: UPF0065 protein in clcB-clcD intergenic region => hypothetical protein
[16:39:45] Modify product: Uncharacterized HTH-type transcriptional regulator Rv1985c => putative HTH-type transcriptional regulator
[16:39:45] Modify product: Putative phosphoribosyl transferase MT0597 => Putative phosphoribosyl transferase
[16:39:45] Modify product: Uncharacterized protein MT2089 => putative protein
[16:39:45] Modify product: Cyclic di-GMP phosphodiesterase PA4781 => Cyclic di-GMP phosphodiesterase
[16:39:45] Modify product: UPF0065 protein in clcB-clcD intergenic region => hypothetical protein
[16:39:45] Modify product: Uncharacterized NTE family protein Rv1063c => putative NTE family protein
[16:39:45] Modify product: Uncharacterized NTE family protein Rv2565 => putative NTE family protein
[16:39:45] Modify product: UPF0065 protein in clcB-clcD intergenic region => hypothetical protein
[16:39:45] Modify product: Uncharacterized protein YbaA => putative protein YbaA
[16:39:45] Modify product: Putative acyltransferase Rv0859 => Putative acyltransferase
[16:39:45] Modify product: UPF0070 protein HI_0370 => hypothetical protein
[16:39:45] Modify product: Segregation and condensation protein B homolog => Segregation and condensation protein B 
[16:39:45] Modify product: Probable ATP-dependent helicase DinG => putative ATP-dependent helicase DinG
[16:39:45] Modify product: Uncharacterized protein Rv2569c => putative protein
[16:39:45] Modify product: Uncharacterized protein Rv2567 => putative protein
[16:39:45] Modify product: Uncharacterized protein Rv2569c => putative protein
[16:39:45] Modify product: Uncharacterized oxidoreductase SA2266 => putative oxidoreductase
[16:39:45] Modify product: Uncharacterized protein NMB0928 => putative protein
[16:39:45] Modify product: UPF0065 protein in clcB-clcD intergenic region => hypothetical protein
[16:39:45] Modify product: UPF0065 protein in clcB-clcD intergenic region => hypothetical protein
[16:39:45] Modify product: Uncharacterized protein HI_1048 => putative protein
[16:39:45] Modify product: Uncharacterized protein Rv2307c => putative protein
[16:39:46] Modify product: Uncharacterized ABC transporter ATP-binding protein Rv0986 => putative ABC transporter ATP-binding protein
[16:39:46] Modify product: Probable type II secretion system protein HxcR => putative type II secretion system protein HxcR
[16:39:46] Modify product: Putative transport protein Rv0205 => Putative transport protein
[16:39:46] Modify product: Probable phosphatase Rv3661 => putative phosphatase
[16:39:46] Modify product: Probable multidrug-efflux transporter Rv1258c => putative multidrug-efflux transporter
[16:39:46] Modify product: Probable xanthine dehydrogenase subunit A => putative xanthine dehydrogenase subunit A
[16:39:46] Modify product: UPF0337 protein PA4738 => hypothetical protein
[16:39:46] Modify product: Solute-binding protein Bamb_6123 => Solute-binding protein
[16:39:46] Modify product: Solute-binding protein Rfer_1840 => Solute-binding protein
[16:39:46] Modify product: Uncharacterized FAD-linked oxidoreductase Rv2280 => putative FAD-linked oxidoreductase
[16:39:46] Modify product: UPF0761 membrane protein YihY => hypothetical protein
[16:39:46] Modify product: Probable 3-phenylpropionic acid transporter => putative 3-phenylpropionic acid transporter
[16:39:46] Modify product: Solute-binding protein Veis_3954 => Solute-binding protein
[16:39:46] Modify product: UPF0065 protein in clcB-clcD intergenic region => hypothetical protein
[16:39:46] Modify product: Probable D-xylose utilization operon transcriptional repressor => putative D-xylose utilization operon transcriptional repressor
[16:39:46] Modify product: Stringent starvation protein A homolog => Stringent starvation protein A 
[16:39:46] Modify product: Stress response UPF0229 protein YhbH => hypothetical protein
[16:39:46] Modify product: UPF0065 protein in clcB-clcD intergenic region => hypothetical protein
[16:39:46] Modify product: Probable 5-dehydro-4-deoxyglucarate dehydratase => putative 5-dehydro-4-deoxyglucarate dehydratase
[16:39:46] Modify product: UPF0065 protein in clcB-clcD intergenic region => hypothetical protein
[16:39:46] Modify product: Solute-binding protein Bpro_3107 => Solute-binding protein
[16:39:46] Modify product: UPF0065 protein in clcB-clcD intergenic region => hypothetical protein
[16:39:46] Modify product: Uncharacterized protein HI_0131 => putative protein
[16:39:46] Modify product: Uncharacterized protein HI_0131 => putative protein
[16:39:46] Modify product: Uncharacterized oxidoreductase Rv1144 => putative oxidoreductase
[16:39:46] Modify product: Leu/Ile/Val-binding protein homolog 3 => Leu/Ile/Val-binding protein 
[16:39:46] Modify product: Putative monooxygenase Rv1533 => Putative monooxygenase
[16:39:46] Modify product: Uncharacterized protein Rv2411c => putative protein
[16:39:46] Modify product: Uncharacterized protein PA3922 => putative protein
[16:39:46] Modify product: UPF0065 protein in clcB-clcD intergenic region => hypothetical protein
[16:39:46] Modify product: UPF0324 inner membrane protein YeiH => hypothetical protein
[16:39:46] Modify product: Probable L-galactonate transporter => putative L-galactonate transporter
[16:39:46] Modify product: Uncharacterized ABC transporter ATP-binding protein YejF => putative ABC transporter ATP-binding protein YejF
[16:39:46] Modify product: Uncharacterized metal-dependent hydrolase YcfH => putative metal-dependent hydrolase YcfH
[16:39:46] Modify product: Acetyltransferase SACOL1063 => Acetyltransferase
[16:39:46] Modify product: Probable intracellular septation protein A => putative intracellular septation protein A
[16:39:46] Modify product: Probable parvulin-type peptidyl-prolyl cis-trans isomerase => putative parvulin-type peptidyl-prolyl cis-trans isomerase
[16:39:46] Modify product: Uncharacterized oxidoreductase YgfF => putative oxidoreductase YgfF
[16:39:46] Modify product: UPF0056 inner membrane protein MarC => hypothetical protein
[16:39:46] Modify product: Probable phospholipid import ATP-binding protein MlaF => putative phospholipid import ATP-binding protein MlaF
[16:39:46] Modify product: Probable phospholipid ABC transporter permease protein MlaE => putative phospholipid ABC transporter permease protein MlaE
[16:39:46] Modify product: Probable membrane transporter protein YfcA => putative membrane transporter protein YfcA
[16:39:46] Modify product: Probable deferrochelatase/peroxidase YfeX => putative deferrochelatase/peroxidase YfeX
[16:39:46] Modify product: Probable multidrug resistance protein EmrY => putative multidrug resistance protein EmrY
[16:39:46] Modify product: Leu/Ile/Val-binding protein homolog 3 => Leu/Ile/Val-binding protein 
[16:39:46] Modify product: Probable D,D-dipeptide transport ATP-binding protein DdpF => putative D,D-dipeptide transport ATP-binding protein DdpF
[16:39:46] Modify product: UPF0065 protein in clcB-clcD intergenic region => hypothetical protein
[16:39:46] Modify product: UPF0065 protein in clcB-clcD intergenic region => hypothetical protein
[16:39:46] Modify product: UPF0065 protein in clcB-clcD intergenic region => hypothetical protein
[16:39:46] Modify product: Probable cystine transporter YijE => putative cystine transporter YijE
[16:39:46] Modify product: UPF0065 protein in clcB-clcD intergenic region => hypothetical protein
[16:39:46] Modify product: Uncharacterized FAD-linked oxidoreductase Rv2280 => putative FAD-linked oxidoreductase
[16:39:46] Modify product: Uncharacterized protein YphB => putative protein YphB
[16:39:46] Modify product: SCO1 protein homolog => SCO1 protein 
[16:39:46] Modify product: UPF0065 protein in clcB-clcD intergenic region => hypothetical protein
[16:39:46] Modify product: Probable diguanylate cyclase DgcC => putative diguanylate cyclase DgcC
[16:39:46] Modify product: Uncharacterized zinc protease Rv2782c => putative zinc protease
[16:39:46] Modify product: Uncharacterized zinc protease Rv2782c => putative zinc protease
[16:39:46] Modify product: Uncharacterized ABC transporter ATP-binding protein Rv1281c => putative ABC transporter ATP-binding protein
[16:39:46] Modify product: UPF0701 protein HI_0467 => hypothetical protein
[16:39:46] Modify product: Cyclic di-GMP phosphodiesterase PA4781 => Cyclic di-GMP phosphodiesterase
[16:39:46] Modify product: UPF0176 protein lpg2838 => hypothetical protein
[16:39:46] Modify product: Putative quercetin 2,3-dioxygenase Rv0181c => Putative quercetin 2,3-dioxygenase
[16:39:46] Modify product: UPF0098 protein CT_736 => hypothetical protein
[16:39:46] Modify product: Putative multidrug export ATP-binding/permease protein SA1683 => Putative multidrug export ATP-binding/permease protein
[16:39:46] Modify product: Probable succinyl-CoA:3-ketoacid coenzyme A transferase subunit B => putative succinyl-CoA:3-ketoacid coenzyme A transferase subunit B
[16:39:46] Modify product: Probable succinyl-CoA:3-ketoacid coenzyme A transferase subunit A => putative succinyl-CoA:3-ketoacid coenzyme A transferase subunit A
[16:39:47] Modify product: UPF0065 protein in clcB-clcD intergenic region => hypothetical protein
[16:39:47] Modify product: Uncharacterized protein Rv2913c => putative protein
[16:39:47] Modify product: Uncharacterized GMC-type oxidoreductase Rv0492c => putative GMC-type oxidoreductase
[16:39:47] Modify product: Probable dual-specificity RNA methyltransferase RlmN => putative dual-specificity RNA methyltransferase RlmN
[16:39:47] Modify product: Co-chaperone protein HscB homolog => Co-chaperone protein HscB 
[16:39:47] Modify product: UPF0065 protein in clcB-clcD intergenic region => hypothetical protein
[16:39:47] Modify product: Bifunctional protein FolD => Bifunctional protein FolD protein
[16:39:47] Modify product: Uncharacterized protein MT3212 => putative protein
[16:39:47] Modify product: Probable hydrolase YcaC => putative hydrolase YcaC
[16:39:47] Modify product: Uncharacterized protein Rv2568c => putative protein
[16:39:47] Modify product: UPF0065 protein in clcB-clcD intergenic region => hypothetical protein
[16:39:47] Modify product: Uncharacterized ABC transporter ATP-binding protein YadG => putative ABC transporter ATP-binding protein YadG
[16:39:47] Modify product: Probable phospholipid-binding protein MlaC => putative phospholipid-binding protein MlaC
[16:39:47] Modify product: Probable phospholipid-binding lipoprotein MlaA => putative phospholipid-binding lipoprotein MlaA
[16:39:47] Modify product: Probable phospholipid ABC transporter-binding protein MlaD => putative phospholipid ABC transporter-binding protein MlaD
[16:39:47] Modify product: Probable phospholipid ABC transporter permease protein MlaE => putative phospholipid ABC transporter permease protein MlaE
[16:39:47] Modify product: Probable phospholipid import ATP-binding protein MlaF => putative phospholipid import ATP-binding protein MlaF
[16:39:47] Modify product: Probable peptidoglycan D,D-transpeptidase PenA => putative peptidoglycan D,D-transpeptidase PenA
[16:39:47] Modify product: Probable peptidoglycan glycosyltransferase FtsW => putative peptidoglycan glycosyltransferase FtsW
[16:39:47] Modify product: Uncharacterized FAD-linked oxidoreductase Rv2280 => putative FAD-linked oxidoreductase
[16:39:47] Modify product: GTP cyclohydrolase 1 type 2 homolog => GTP cyclohydrolase 1 type 2 
[16:39:47] Cleaned 358 /product names
[16:39:47] Deleting unwanted file: data/annotation/bin1/sprot.faa
[16:39:47] Deleting unwanted file: data/annotation/bin1/sprot.blast
[16:39:47] There are still 1430 unannotated CDS left (started with 4294)
[16:39:47] Will use hmmer3 to search against /opt/miniconda3/envs/metagenomics_practical/bin/../db/hmm/HAMAP.hmm with 1 CPUs
[16:39:47] Running: cat data\/annotation\/bin1\/HAMAP\.hmm\.faa | parallel --gnu --plain -j 1 --block 173344 --recstart '>' --pipe hmmscan --noali --notextw --acc -E 1e-06 --cpu 1 /opt/miniconda3/envs/metagenomics_practical/bin/../db/hmm/HAMAP.hmm /dev/stdin > data\/annotation\/bin1\/HAMAP\.hmm\.hmmer3 2> /dev/null
[16:40:27] Modify product: Uncharacterized Nudix hydrolase NudL => putative Nudix hydrolase NudL
[16:40:27] Cleaned 1 /product names
[16:40:27] Deleting unwanted file: data/annotation/bin1/HAMAP.hmm.faa
[16:40:27] Deleting unwanted file: data/annotation/bin1/HAMAP.hmm.hmmer3
[16:40:27] Labelling remaining 1338 proteins as 'hypothetical protein'
[16:40:27] Possible /pseudo '2-isopropylmalate synthase' at gnl|X|HFHIMAGJ_1 position 429143
[16:40:27] Possible /pseudo 'Putative aliphatic sulfonates-binding protein' at gnl|X|HFHIMAGJ_1 position 450886
[16:40:27] Possible /pseudo 'Putative aliphatic sulfonates-binding protein' at gnl|X|HFHIMAGJ_1 position 457008
[16:40:27] Possible /pseudo 'Putative aliphatic sulfonates-binding protein' at gnl|X|HFHIMAGJ_1 position 457987
[16:40:27] Possible /pseudo 'Putative HTH-type transcriptional regulator' at gnl|X|HFHIMAGJ_1 position 490964
[16:40:27] Possible /pseudo 'Riboflavin transport system permease protein RibX' at gnl|X|HFHIMAGJ_1 position 497782
[16:40:27] Possible /pseudo 'Isocitrate dehydrogenase [NADP]' at gnl|X|HFHIMAGJ_1 position 594202
[16:40:27] Possible /pseudo 'Alpha-keto acid-binding periplasmic protein TakP' at gnl|X|HFHIMAGJ_2 position 153845
[16:40:27] Possible /pseudo 'Alpha-keto acid-binding periplasmic protein TakP' at gnl|X|HFHIMAGJ_2 position 155058
[16:40:27] Possible /pseudo 'Putative HTH-type transcriptional regulator' at gnl|X|HFHIMAGJ_3 position 48146
[16:40:27] Possible /pseudo 'Short-chain-enoyl-CoA hydratase' at gnl|X|HFHIMAGJ_5 position 197926
[16:40:27] Possible /pseudo 'Heme/hemopexin-binding protein' at gnl|X|HFHIMAGJ_6 position 52625
[16:40:27] Possible /pseudo 'Lipid A biosynthesis lauroyltransferase' at gnl|X|HFHIMAGJ_7 position 30350
[16:40:27] Possible /pseudo 'L-cystine-binding protein FliY' at gnl|X|HFHIMAGJ_7 position 165912
[16:40:27] Possible /pseudo 'D-methionine-binding lipoprotein MetQ' at gnl|X|HFHIMAGJ_7 position 176506
[16:40:27] Possible /pseudo 'Dibenzothiophene desulfurization enzyme C' at gnl|X|HFHIMAGJ_7 position 180000
[16:40:27] Possible /pseudo 'Leucine-, isoleucine-, valine-, threonine-, and alanine-binding protein' at gnl|X|HFHIMAGJ_8 position 54470
[16:40:27] Possible /pseudo 'Type I secretion system ATP-binding protein PrsD' at gnl|X|HFHIMAGJ_8 position 133595
[16:40:27] Possible /pseudo 'Methylxanthine N1-demethylase NdmA' at gnl|X|HFHIMAGJ_9 position 79193
[16:40:27] Possible /pseudo 'A-type flagellin' at gnl|X|HFHIMAGJ_10 position 71977
[16:40:27] Possible /pseudo 'Cyanophycin synthetase' at gnl|X|HFHIMAGJ_11 position 25117
[16:40:27] Possible /pseudo 'Signal transduction histidine-protein kinase BarA' at gnl|X|HFHIMAGJ_12 position 68081
[16:40:27] Possible /pseudo 'Chromate transport protein' at gnl|X|HFHIMAGJ_12 position 100187
[16:40:27] Possible /pseudo 'putative protein' at gnl|X|HFHIMAGJ_13 position 10031
[16:40:27] Possible /pseudo 'Autoinducer 2 sensor kinase/phosphatase LuxQ' at gnl|X|HFHIMAGJ_16 position 4119
[16:40:27] Possible /pseudo 'CAI-1 autoinducer sensor kinase/phosphatase CqsS' at gnl|X|HFHIMAGJ_16 position 96763
[16:40:27] Possible /pseudo 'Biopolymer transport protein ExbD' at gnl|X|HFHIMAGJ_18 position 8843
[16:40:27] Possible /pseudo 'Homoprotocatechuate catabolism bifunctional isomerase/decarboxylase' at gnl|X|HFHIMAGJ_18 position 45718
[16:40:27] Possible /pseudo 'putative zinc protease' at gnl|X|HFHIMAGJ_19 position 14616
[16:40:27] Possible /pseudo 'Cytochrome c oxidase subunit 2' at gnl|X|HFHIMAGJ_20 position 28494
[16:40:27] Possible /pseudo 'Cytochrome c oxidase subunit 2' at gnl|X|HFHIMAGJ_20 position 28879
[16:40:27] Possible /pseudo 'Methyl-accepting chemotaxis protein I' at gnl|X|HFHIMAGJ_20 position 49736
[16:40:27] Found 1914 unique /gene codes.
[16:40:27] Fixed 6 duplicate /gene - cph1_1 cph1_2 cph1_3 cph1_4 cph1_5 cph1_6
[16:40:27] Fixed 5 duplicate /gene - gsiD_1 gsiD_2 gsiD_3 gsiD_4 gsiD_5
[16:40:27] Fixed 3 duplicate /gene - cpdA_1 cpdA_2 cpdA_3
[16:40:27] Fixed 3 duplicate /gene - paaG_1 paaG_2 paaG_3
[16:40:27] Fixed 3 duplicate /gene - trxA_1 trxA_2 trxA_3
[16:40:27] Fixed 2 duplicate /gene - yadH_1 yadH_2
[16:40:27] Fixed 5 duplicate /gene - ssuC_1 ssuC_2 ssuC_3 ssuC_4 ssuC_5
[16:40:27] Fixed 3 duplicate /gene - arcB_1 arcB_2 arcB_3
[16:40:27] Fixed 2 duplicate /gene - fmt_1 fmt_2
[16:40:27] Fixed 2 duplicate /gene - thiE_1 thiE_2
[16:40:27] Fixed 4 duplicate /gene - soxC_1 soxC_2 soxC_3 soxC_4
[16:40:27] Fixed 4 duplicate /gene - rhlE_1 rhlE_2 rhlE_3 rhlE_4
[16:40:27] Fixed 2 duplicate /gene - hpcE_1 hpcE_2
[16:40:27] Fixed 2 duplicate /gene - rnr_1 rnr_2
[16:40:27] Fixed 2 duplicate /gene - hemN_1 hemN_2
[16:40:27] Fixed 2 duplicate /gene - greA_1 greA_2
[16:40:27] Fixed 5 duplicate /gene - bepA_1 bepA_2 bepA_3 bepA_4 bepA_5
[16:40:27] Fixed 2 duplicate /gene - hexR_1 hexR_2
[16:40:27] Fixed 4 duplicate /gene - hyuA_1 hyuA_2 hyuA_3 hyuA_4
[16:40:27] Fixed 3 duplicate /gene - rutB_1 rutB_2 rutB_3
[16:40:27] Fixed 3 duplicate /gene - prkC_1 prkC_2 prkC_3
[16:40:27] Fixed 2 duplicate /gene - fosB_1 fosB_2
[16:40:27] Fixed 2 duplicate /gene - otnK_1 otnK_2
[16:40:27] Fixed 2 duplicate /gene - hosA_1 hosA_2
[16:40:27] Fixed 2 duplicate /gene - mepM_1 mepM_2
[16:40:27] Fixed 2 duplicate /gene - uvrA_1 uvrA_2
[16:40:27] Fixed 2 duplicate /gene - cheA_1 cheA_2
[16:40:27] Fixed 2 duplicate /gene - loiP_1 loiP_2
[16:40:27] Fixed 4 duplicate /gene - ctaC_1 ctaC_2 ctaC_3 ctaC_4
[16:40:27] Fixed 2 duplicate /gene - rpoE_1 rpoE_2
[16:40:27] Fixed 2 duplicate /gene - tam_1 tam_2
[16:40:27] Fixed 2 duplicate /gene - dhaT_1 dhaT_2
[16:40:27] Fixed 2 duplicate /gene - ltnD_1 ltnD_2
[16:40:27] Fixed 2 duplicate /gene - mrcA_1 mrcA_2
[16:40:27] Fixed 2 duplicate /gene - dnaJ_1 dnaJ_2
[16:40:27] Fixed 3 duplicate /gene - oatA_1 oatA_2 oatA_3
[16:40:27] Fixed 2 duplicate /gene - srkA_1 srkA_2
[16:40:27] Fixed 6 duplicate /gene - cysA_1 cysA_2 cysA_3 cysA_4 cysA_5 cysA_6
[16:40:27] Fixed 2 duplicate /gene - flhA_1 flhA_2
[16:40:27] Fixed 2 duplicate /gene - mobA_1 mobA_2
[16:40:27] Fixed 6 duplicate /gene - exbD_1 exbD_2 exbD_3 exbD_4 exbD_5 exbD_6
[16:40:27] Fixed 4 duplicate /gene - pilT_1 pilT_2 pilT_3 pilT_4
[16:40:27] Fixed 2 duplicate /gene - fliC_1 fliC_2
[16:40:27] Fixed 2 duplicate /gene - dapA_1 dapA_2
[16:40:27] Fixed 2 duplicate /gene - nuoB_1 nuoB_2
[16:40:27] Fixed 2 duplicate /gene - todS_1 todS_2
[16:40:27] Fixed 2 duplicate /gene - slcC_1 slcC_2
[16:40:27] Fixed 2 duplicate /gene - mscM_1 mscM_2
[16:40:27] Fixed 2 duplicate /gene - dcmA_1 dcmA_2
[16:40:27] Fixed 2 duplicate /gene - hpd_1 hpd_2
[16:40:27] Fixed 3 duplicate /gene - nlhH_1 nlhH_2 nlhH_3
[16:40:27] Fixed 2 duplicate /gene - tas_1 tas_2
[16:40:27] Fixed 3 duplicate /gene - iolS_1 iolS_2 iolS_3
[16:40:27] Fixed 2 duplicate /gene - prmA_1 prmA_2
[16:40:27] Fixed 2 duplicate /gene - cckA_1 cckA_2
[16:40:27] Fixed 2 duplicate /gene - bcr_1 bcr_2
[16:40:27] Fixed 2 duplicate /gene - ompR_1 ompR_2
[16:40:27] Fixed 6 duplicate /gene - dctM_1 dctM_2 dctM_3 dctM_4 dctM_5 dctM_6
[16:40:27] Fixed 8 duplicate /gene - crp_1 crp_2 crp_3 crp_4 crp_5 crp_6 crp_7 crp_8
[16:40:27] Fixed 2 duplicate /gene - liaR_1 liaR_2
[16:40:27] Fixed 5 duplicate /gene - cmpD_1 cmpD_2 cmpD_3 cmpD_4 cmpD_5
[16:40:27] Fixed 2 duplicate /gene - grxC_1 grxC_2
[16:40:27] Fixed 2 duplicate /gene - kefC_1 kefC_2
[16:40:27] Fixed 2 duplicate /gene - rsmI_1 rsmI_2
[16:40:27] Fixed 3 duplicate /gene - trxB_1 trxB_2 trxB_3
[16:40:27] Fixed 2 duplicate /gene - stkP_1 stkP_2
[16:40:27] Fixed 2 duplicate /gene - cmpR_1 cmpR_2
[16:40:27] Fixed 10 duplicate /gene - rspR_1 rspR_2 rspR_3 rspR_4 rspR_5 rspR_6 rspR_7 rspR_8 rspR_9 rspR_10
[16:40:27] Fixed 2 duplicate /gene - kup_1 kup_2
[16:40:27] Fixed 2 duplicate /gene - metXA_1 metXA_2
[16:40:27] Fixed 2 duplicate /gene - araQ_1 araQ_2
[16:40:27] Fixed 2 duplicate /gene - egtB_1 egtB_2
[16:40:27] Fixed 2 duplicate /gene - apc3_1 apc3_2
[16:40:27] Fixed 5 duplicate /gene - lrp_1 lrp_2 lrp_3 lrp_4 lrp_5
[16:40:27] Fixed 8 duplicate /gene - uctC_1 uctC_2 uctC_3 uctC_4 uctC_5 uctC_6 uctC_7 uctC_8
[16:40:27] Fixed 2 duplicate /gene - chrR_1 chrR_2
[16:40:27] Fixed 2 duplicate /gene - vraR_1 vraR_2
[16:40:27] Fixed 4 duplicate /gene - oprM_1 oprM_2 oprM_3 oprM_4
[16:40:27] Fixed 2 duplicate /gene - garA_1 garA_2
[16:40:27] Fixed 5 duplicate /gene - crt_1 crt_2 crt_3 crt_4 crt_5
[16:40:27] Fixed 2 duplicate /gene - phoB_1 phoB_2
[16:40:27] Fixed 2 duplicate /gene - lpxH_1 lpxH_2
[16:40:27] Fixed 2 duplicate /gene - icd_1 icd_2
[16:40:27] Fixed 4 duplicate /gene - acsA_1 acsA_2 acsA_3 acsA_4
[16:40:27] Fixed 2 duplicate /gene - aspC_1 aspC_2
[16:40:27] Fixed 2 duplicate /gene - mshD_1 mshD_2
[16:40:27] Fixed 2 duplicate /gene - livJ_1 livJ_2
[16:40:27] Fixed 2 duplicate /gene - hmrR_1 hmrR_2
[16:40:27] Fixed 2 duplicate /gene - dnaK_1 dnaK_2
[16:40:27] Fixed 2 duplicate /gene - btsR_1 btsR_2
[16:40:27] Fixed 2 duplicate /gene - rlmN_1 rlmN_2
[16:40:27] Fixed 3 duplicate /gene - lpxL_1 lpxL_2 lpxL_3
[16:40:27] Fixed 2 duplicate /gene - chrA1_1 chrA1_2
[16:40:27] Fixed 2 duplicate /gene - nrtA_1 nrtA_2
[16:40:27] Fixed 2 duplicate /gene - hmuU_1 hmuU_2
[16:40:27] Fixed 2 duplicate /gene - sttH_1 sttH_2
[16:40:27] Fixed 4 duplicate /gene - ugpC_1 ugpC_2 ugpC_3 ugpC_4
[16:40:27] Fixed 2 duplicate /gene - rsxB_1 rsxB_2
[16:40:27] Fixed 2 duplicate /gene - livK_1 livK_2
[16:40:27] Fixed 2 duplicate /gene - corA_1 corA_2
[16:40:27] Fixed 2 duplicate /gene - hisF_1 hisF_2
[16:40:27] Fixed 3 duplicate /gene - dppC_1 dppC_2 dppC_3
[16:40:27] Fixed 3 duplicate /gene - tolC_1 tolC_2 tolC_3
[16:40:27] Fixed 2 duplicate /gene - pcpR_1 pcpR_2
[16:40:27] Fixed 2 duplicate /gene - ligA_1 ligA_2
[16:40:27] Fixed 2 duplicate /gene - prpC_1 prpC_2
[16:40:27] Fixed 2 duplicate /gene - fbp_1 fbp_2
[16:40:27] Fixed 2 duplicate /gene - metI_1 metI_2
[16:40:27] Fixed 4 duplicate /gene - barA_1 barA_2 barA_3 barA_4
[16:40:27] Fixed 2 duplicate /gene - gpx1_1 gpx1_2
[16:40:27] Fixed 3 duplicate /gene - uvrY_1 uvrY_2 uvrY_3
[16:40:27] Fixed 7 duplicate /gene - ribX_1 ribX_2 ribX_3 ribX_4 ribX_5 ribX_6 ribX_7
[16:40:27] Fixed 4 duplicate /gene - potA_1 potA_2 potA_3 potA_4
[16:40:27] Fixed 5 duplicate /gene - aam_1 aam_2 aam_3 aam_4 aam_5
[16:40:27] Fixed 2 duplicate /gene - fcuA_1 fcuA_2
[16:40:27] Fixed 2 duplicate /gene - thiD_1 thiD_2
[16:40:27] Fixed 3 duplicate /gene - clcD_1 clcD_2 clcD_3
[16:40:27] Fixed 2 duplicate /gene - entS_1 entS_2
[16:40:27] Fixed 3 duplicate /gene - dauA_1 dauA_2 dauA_3
[16:40:27] Fixed 3 duplicate /gene - pspB_1 pspB_2 pspB_3
[16:40:27] Fixed 2 duplicate /gene - fdx_1 fdx_2
[16:40:27] Fixed 2 duplicate /gene - echA8_1 echA8_2
[16:40:27] Fixed 2 duplicate /gene - sugB_1 sugB_2
[16:40:27] Fixed 2 duplicate /gene - mcbR_1 mcbR_2
[16:40:27] Fixed 2 duplicate /gene - glpE_1 glpE_2
[16:40:27] Fixed 2 duplicate /gene - argE_1 argE_2
[16:40:27] Fixed 2 duplicate /gene - yjiE_1 yjiE_2
[16:40:27] Fixed 2 duplicate /gene - efe_1 efe_2
[16:40:27] Fixed 4 duplicate /gene - yisK_1 yisK_2 yisK_3 yisK_4
[16:40:27] Fixed 2 duplicate /gene - fabI_1 fabI_2
[16:40:27] Fixed 3 duplicate /gene - trg_1 trg_2 trg_3
[16:40:27] Fixed 2 duplicate /gene - rimI_1 rimI_2
[16:40:27] Fixed 3 duplicate /gene - dpgD_1 dpgD_2 dpgD_3
[16:40:27] Fixed 2 duplicate /gene - degU_1 degU_2
[16:40:27] Fixed 3 duplicate /gene - takP_1 takP_2 takP_3
[16:40:27] Fixed 3 duplicate /gene - catD_1 catD_2 catD_3
[16:40:27] Fixed 4 duplicate /gene - lutR_1 lutR_2 lutR_3 lutR_4
[16:40:27] Fixed 5 duplicate /gene - qseC_1 qseC_2 qseC_3 qseC_4 qseC_5
[16:40:27] Fixed 2 duplicate /gene - katE_1 katE_2
[16:40:27] Fixed 2 duplicate /gene - scpA_1 scpA_2
[16:40:27] Fixed 2 duplicate /gene - bvgS_1 bvgS_2
[16:40:27] Fixed 4 duplicate /gene - tsr_1 tsr_2 tsr_3 tsr_4
[16:40:27] Fixed 3 duplicate /gene - metQ_1 metQ_2 metQ_3
[16:40:27] Fixed 2 duplicate /gene - leuC_1 leuC_2
[16:40:27] Fixed 2 duplicate /gene - hbpA_1 hbpA_2
[16:40:27] Fixed 2 duplicate /gene - modA_1 modA_2
[16:40:27] Fixed 2 duplicate /gene - cphA_1 cphA_2
[16:40:27] Fixed 2 duplicate /gene - btuB_1 btuB_2
[16:40:27] Fixed 3 duplicate /gene - gltI_1 gltI_2 gltI_3
[16:40:27] Fixed 2 duplicate /gene - occR_1 occR_2
[16:40:27] Fixed 4 duplicate /gene - rhtB_1 rhtB_2 rhtB_3 rhtB_4
[16:40:27] Fixed 2 duplicate /gene - yeaW_1 yeaW_2
[16:40:27] Fixed 3 duplicate /gene - tctD_1 tctD_2 tctD_3
[16:40:27] Fixed 2 duplicate /gene - ftsH_1 ftsH_2
[16:40:27] Fixed 2 duplicate /gene - dmoA_1 dmoA_2
[16:40:27] Fixed 3 duplicate /gene - yiaM_1 yiaM_2 yiaM_3
[16:40:27] Fixed 2 duplicate /gene - tdiR_1 tdiR_2
[16:40:27] Fixed 3 duplicate /gene - cqsS_1 cqsS_2 cqsS_3
[16:40:27] Fixed 3 duplicate /gene - iorB_1 iorB_2 iorB_3
[16:40:27] Fixed 2 duplicate /gene - ydbM_1 ydbM_2
[16:40:27] Fixed 2 duplicate /gene - accA1_1 accA1_2
[16:40:27] Fixed 2 duplicate /gene - hisH_1 hisH_2
[16:40:27] Fixed 6 duplicate /gene - cynR_1 cynR_2 cynR_3 cynR_4 cynR_5 cynR_6
[16:40:27] Fixed 4 duplicate /gene - menH_1 menH_2 menH_3 menH_4
[16:40:27] Fixed 3 duplicate /gene - mshA_1 mshA_2 mshA_3
[16:40:27] Fixed 2 duplicate /gene - ligC_1 ligC_2
[16:40:27] Fixed 3 duplicate /gene - epsE_1 epsE_2 epsE_3
[16:40:27] Fixed 2 duplicate /gene - moeA_1 moeA_2
[16:40:27] Fixed 2 duplicate /gene - nicS_1 nicS_2
[16:40:27] Fixed 4 duplicate /gene - resA_1 resA_2 resA_3 resA_4
[16:40:27] Fixed 2 duplicate /gene - nfdA_1 nfdA_2
[16:40:27] Fixed 9 duplicate /gene - fabG_1 fabG_2 fabG_3 fabG_4 fabG_5 fabG_6 fabG_7 fabG_8 fabG_9
[16:40:27] Fixed 2 duplicate /gene - cysW_1 cysW_2
[16:40:27] Fixed 2 duplicate /gene - kstR2_1 kstR2_2
[16:40:27] Fixed 3 duplicate /gene - ydcV_1 ydcV_2 ydcV_3
[16:40:27] Fixed 2 duplicate /gene - nagK_1 nagK_2
[16:40:27] Fixed 4 duplicate /gene - ribN_1 ribN_2 ribN_3 ribN_4
[16:40:27] Fixed 2 duplicate /gene - cpg2_1 cpg2_2
[16:40:27] Fixed 2 duplicate /gene - der_1 der_2
[16:40:27] Fixed 2 duplicate /gene - qorA_1 qorA_2
[16:40:27] Fixed 5 duplicate /gene - tolQ_1 tolQ_2 tolQ_3 tolQ_4 tolQ_5
[16:40:27] Fixed 2 duplicate /gene - pitA_1 pitA_2
[16:40:27] Fixed 2 duplicate /gene - nnr_1 nnr_2
[16:40:27] Fixed 3 duplicate /gene - plsC_1 plsC_2 plsC_3
[16:40:27] Fixed 2 duplicate /gene - ctaD_1 ctaD_2
[16:40:27] Fixed 2 duplicate /gene - pimB_1 pimB_2
[16:40:27] Fixed 3 duplicate /gene - ycaC_1 ycaC_2 ycaC_3
[16:40:27] Fixed 2 duplicate /gene - desR_1 desR_2
[16:40:27] Fixed 2 duplicate /gene - soj_1 soj_2
[16:40:27] Fixed 3 duplicate /gene - rpfC_1 rpfC_2 rpfC_3
[16:40:27] Fixed 3 duplicate /gene - fadA_1 fadA_2 fadA_3
[16:40:27] Fixed 17 duplicate /gene - livF_1 livF_2 livF_3 livF_4 livF_5 livF_6 livF_7 livF_8 livF_9 livF_10 livF_11 livF_12 livF_13 livF_14 livF_15 livF_16 livF_17
[16:40:27] Fixed 2 duplicate /gene - leuA_1 leuA_2
[16:40:27] Fixed 2 duplicate /gene - yiaO_1 yiaO_2
[16:40:27] Fixed 2 duplicate /gene - rluB_1 rluB_2
[16:40:27] Fixed 2 duplicate /gene - alkA_1 alkA_2
[16:40:27] Fixed 2 duplicate /gene - tauD_1 tauD_2
[16:40:27] Fixed 2 duplicate /gene - cheR_1 cheR_2
[16:40:27] Fixed 2 duplicate /gene - aqpZ_1 aqpZ_2
[16:40:27] Fixed 2 duplicate /gene - ugpB_1 ugpB_2
[16:40:27] Fixed 3 duplicate /gene - luxQ_1 luxQ_2 luxQ_3
[16:40:27] Fixed 2 duplicate /gene - hxcR_1 hxcR_2
[16:40:27] Fixed 3 duplicate /gene - pytH_1 pytH_2 pytH_3
[16:40:27] Fixed 2 duplicate /gene - fdhF_1 fdhF_2
[16:40:27] Fixed 5 duplicate /gene - ssuB_1 ssuB_2 ssuB_3 ssuB_4 ssuB_5
[16:40:27] Fixed 2 duplicate /gene - rutR_1 rutR_2
[16:40:27] Fixed 3 duplicate /gene - slt_1 slt_2 slt_3
[16:40:27] Fixed 3 duplicate /gene - glxR_1 glxR_2 glxR_3
[16:40:27] Fixed 2 duplicate /gene - stp_1 stp_2
[16:40:27] Fixed 2 duplicate /gene - garD_1 garD_2
[16:40:27] Fixed 5 duplicate /gene - hipO_1 hipO_2 hipO_3 hipO_4 hipO_5
[16:40:27] Fixed 3 duplicate /gene - slyA_1 slyA_2 slyA_3
[16:40:27] Fixed 2 duplicate /gene - mtnN_1 mtnN_2
[16:40:27] Fixed 2 duplicate /gene - cysE_1 cysE_2
[16:40:27] Fixed 2 duplicate /gene - nuoF_1 nuoF_2
[16:40:27] Fixed 2 duplicate /gene - ispD_1 ispD_2
[16:40:27] Fixed 3 duplicate /gene - potB_1 potB_2 potB_3
[16:40:27] Fixed 2 duplicate /gene - prmC_1 prmC_2
[16:40:27] Fixed 2 duplicate /gene - guaD_1 guaD_2
[16:40:27] Fixed 2 duplicate /gene - apc4_1 apc4_2
[16:40:27] Fixed 2 duplicate /gene - clsB_1 clsB_2
[16:40:27] Fixed 2 duplicate /gene - yngG_1 yngG_2
[16:40:27] Fixed 2 duplicate /gene - ifcA_1 ifcA_2
[16:40:27] Fixed 5 duplicate /gene - ribY_1 ribY_2 ribY_3 ribY_4 ribY_5
[16:40:27] Fixed 4 duplicate /gene - dadA1_1 dadA1_2 dadA1_3 dadA1_4
[16:40:27] Fixed 2 duplicate /gene - cnbA_1 cnbA_2
[16:40:27] Fixed 2 duplicate /gene - araD_1 araD_2
[16:40:27] Fixed 2 duplicate /gene - dkgB_1 dkgB_2
[16:40:27] Fixed 2 duplicate /gene - todT_1 todT_2
[16:40:27] Fixed 2 duplicate /gene - pbpG_1 pbpG_2
[16:40:27] Fixed 4 duplicate /gene - cloR_1 cloR_2 cloR_3 cloR_4
[16:40:27] Fixed 6 duplicate /gene - braC_1 braC_2 braC_3 braC_4 braC_5 braC_6
[16:40:27] Fixed 2 duplicate /gene - ogt_1 ogt_2
[16:40:27] Fixed 2 duplicate /gene - yadG_1 yadG_2
[16:40:27] Fixed 3 duplicate /gene - btsS_1 btsS_2 btsS_3
[16:40:27] Fixed 3 duplicate /gene - cobD_1 cobD_2 cobD_3
[16:40:27] Fixed 8 duplicate /gene - ssuA_1 ssuA_2 ssuA_3 ssuA_4 ssuA_5 ssuA_6 ssuA_7 ssuA_8
[16:40:27] Fixed 3 duplicate /gene - htpX_1 htpX_2 htpX_3
[16:40:27] Fixed 3 duplicate /gene - siaP_1 siaP_2 siaP_3
[16:40:27] Fixed 3 duplicate /gene - lldR_1 lldR_2 lldR_3
[16:40:27] Fixed 2 duplicate /gene - czcA_1 czcA_2
[16:40:27] Fixed 5 duplicate /gene - yfcG_1 yfcG_2 yfcG_3 yfcG_4 yfcG_5
[16:40:27] Fixed 3 duplicate /gene - phoR_1 phoR_2 phoR_3
[16:40:27] Fixed 2 duplicate /gene - mlaF_1 mlaF_2
[16:40:27] Fixed 2 duplicate /gene - yiaD_1 yiaD_2
[16:40:27] Fixed 3 duplicate /gene - glnM_1 glnM_2 glnM_3
[16:40:27] Fixed 2 duplicate /gene - eno_1 eno_2
[16:40:27] Fixed 2 duplicate /gene - cheW_1 cheW_2
[16:40:27] Fixed 2 duplicate /gene - bepE_1 bepE_2
[16:40:27] Fixed 2 duplicate /gene - osmY_1 osmY_2
[16:40:27] Fixed 2 duplicate /gene - rimO_1 rimO_2
[16:40:27] Fixed 2 duplicate /gene - alaS_1 alaS_2
[16:40:27] Fixed 2 duplicate /gene - phaC_1 phaC_2
[16:40:27] Fixed 3 duplicate /gene - copA_1 copA_2 copA_3
[16:40:27] Fixed 2 duplicate /gene - lemA_1 lemA_2
[16:40:27] Fixed 2 duplicate /gene - pcm_1 pcm_2
[16:40:27] Fixed 2 duplicate /gene - fixJ_1 fixJ_2
[16:40:27] Fixed 6 duplicate /gene - oppD_1 oppD_2 oppD_3 oppD_4 oppD_5 oppD_6
[16:40:27] Fixed 2 duplicate /gene - ydgJ_1 ydgJ_2
[16:40:27] Fixed 2 duplicate /gene - sbp_1 sbp_2
[16:40:27] Fixed 2 duplicate /gene - nreB_1 nreB_2
[16:40:27] Fixed 2 duplicate /gene - tmoT_1 tmoT_2
[16:40:27] Fixed 2 duplicate /gene - curA_1 curA_2
[16:40:27] Fixed 2 duplicate /gene - ribBA_1 ribBA_2
[16:40:27] Fixed 2 duplicate /gene - hyuC_1 hyuC_2
[16:40:27] Fixed 3 duplicate /gene - pilE_1 pilE_2 pilE_3
[16:40:27] Fixed 2 duplicate /gene - yajR_1 yajR_2
[16:40:27] Fixed 17 duplicate /gene - livH_1 livH_2 livH_3 livH_4 livH_5 livH_6 livH_7 livH_8 livH_9 livH_10 livH_11 livH_12 livH_13 livH_14 livH_15 livH_16 livH_17
[16:40:27] Fixed 2 duplicate /gene - dgoD_1 dgoD_2
[16:40:27] Fixed 6 duplicate /gene - mmgC_1 mmgC_2 mmgC_3 mmgC_4 mmgC_5 mmgC_6
[16:40:27] Fixed 2 duplicate /gene - xthA_1 xthA_2
[16:40:27] Fixed 2 duplicate /gene - pgdA_1 pgdA_2
[16:40:27] Fixed 3 duplicate /gene - tauB_1 tauB_2 tauB_3
[16:40:27] Fixed 4 duplicate /gene - fadD_1 fadD_2 fadD_3 fadD_4
[16:40:27] Fixed 2 duplicate /gene - yhaJ_1 yhaJ_2
[16:40:27] Fixed 2 duplicate /gene - mdtC_1 mdtC_2
[16:40:27] Fixed 3 duplicate /gene - ydiK_1 ydiK_2 ydiK_3
[16:40:27] Fixed 2 duplicate /gene - sugA_1 sugA_2
[16:40:27] Fixed 9 duplicate /gene - gsiC_1 gsiC_2 gsiC_3 gsiC_4 gsiC_5 gsiC_6 gsiC_7 gsiC_8 gsiC_9
[16:40:27] Fixed 2 duplicate /gene - frc_1 frc_2
[16:40:27] Fixed 2 duplicate /gene - yhjX_1 yhjX_2
[16:40:27] Fixed 3 duplicate /gene - fabH_1 fabH_2 fabH_3
[16:40:27] Fixed 2 duplicate /gene - mdtA_1 mdtA_2
[16:40:27] Fixed 2 duplicate /gene - dmdB_1 dmdB_2
[16:40:27] Fixed 2 duplicate /gene - mepA_1 mepA_2
[16:40:27] Fixed 2 duplicate /gene - limB_1 limB_2
[16:40:27] Fixed 2 duplicate /gene - metN_1 metN_2
[16:40:27] Fixed 2 duplicate /gene - cyaA_1 cyaA_2
[16:40:27] Fixed 2 duplicate /gene - atpH_1 atpH_2
[16:40:27] Fixed 3 duplicate /gene - dppA_1 dppA_2 dppA_3
[16:40:27] Fixed 3 duplicate /gene - rbsA_1 rbsA_2 rbsA_3
[16:40:27] Fixed 13 duplicate /gene - lptB_1 lptB_2 lptB_3 lptB_4 lptB_5 lptB_6 lptB_7 lptB_8 lptB_9 lptB_10 lptB_11 lptB_12 lptB_13
[16:40:27] Fixed 4 duplicate /gene - yabJ_1 yabJ_2 yabJ_3 yabJ_4
[16:40:27] Fixed 2 duplicate /gene - amiC_1 amiC_2
[16:40:27] Fixed 2 duplicate /gene - ybaK_1 ybaK_2
[16:40:27] Fixed 2 duplicate /gene - guaB_1 guaB_2
[16:40:27] Fixed 2 duplicate /gene - nagC_1 nagC_2
[16:40:27] Fixed 2 duplicate /gene - gstA_1 gstA_2
[16:40:27] Fixed 2 duplicate /gene - ohrR_1 ohrR_2
[16:40:27] Fixed 5 duplicate /gene - gsiB_1 gsiB_2 gsiB_3 gsiB_4 gsiB_5
[16:40:27] Fixed 3 duplicate /gene - lysN_1 lysN_2 lysN_3
[16:40:27] Fixed 3 duplicate /gene - proA_1 proA_2 proA_3
[16:40:27] Fixed 2 duplicate /gene - ycf3_1 ycf3_2
[16:40:27] Fixed 7 duplicate /gene - glnQ_1 glnQ_2 glnQ_3 glnQ_4 glnQ_5 glnQ_6 glnQ_7
[16:40:27] Fixed 2 duplicate /gene - kdgR_1 kdgR_2
[16:40:27] Fixed 2 duplicate /gene - ppsC_1 ppsC_2
[16:40:27] Fixed 2 duplicate /gene - thadh_1 thadh_2
[16:40:27] Fixed 3 duplicate /gene - pgrR_1 pgrR_2 pgrR_3
[16:40:27] Fixed 2 duplicate /gene - arsC_1 arsC_2
[16:40:27] Fixed 3 duplicate /gene - fliY_1 fliY_2 fliY_3
[16:40:27] Fixed 2 duplicate /gene - rluA_1 rluA_2
[16:40:27] Fixed 2 duplicate /gene - tsaC1_1 tsaC1_2
[16:40:27] Fixed 2 duplicate /gene - dgcC_1 dgcC_2
[16:40:27] Fixed 2 duplicate /gene - comM_1 comM_2
[16:40:27] Fixed 3 duplicate /gene - gno_1 gno_2 gno_3
[16:40:27] Fixed 3 duplicate /gene - dppD_1 dppD_2 dppD_3
[16:40:27] Fixed 2 duplicate /gene - zntR_1 zntR_2
[16:40:27] Fixed 3 duplicate /gene - lcfB_1 lcfB_2 lcfB_3
[16:40:27] Fixed 2 duplicate /gene - ddpC_1 ddpC_2
[16:40:27] Fixed 3 duplicate /gene - cmpB_1 cmpB_2 cmpB_3
[16:40:27] Fixed 2 duplicate /gene - mlaE_1 mlaE_2
[16:40:27] Fixed 4 duplicate /gene - dapE_1 dapE_2 dapE_3 dapE_4
[16:40:27] Fixed 2 duplicate /gene - estA_1 estA_2
[16:40:27] Fixed 2 duplicate /gene - prsD_1 prsD_2
[16:40:27] Fixed 6 duplicate /gene - fixL_1 fixL_2 fixL_3 fixL_4 fixL_5 fixL_6
[16:40:27] Fixed 3 duplicate /gene - cpo_1 cpo_2 cpo_3
[16:40:27] Fixed 2 duplicate /gene - bclA_1 bclA_2
[16:40:27] Fixed 3 duplicate /gene - dan_1 dan_2 dan_3
[16:40:27] Fixed 2 duplicate /gene - nasD_1 nasD_2
[16:40:27] Fixed 4 duplicate /gene - gntR_1 gntR_2 gntR_3 gntR_4
[16:40:27] Fixed 2 duplicate /gene - gltA_1 gltA_2
[16:40:27] Fixed 3 duplicate /gene - cadA_1 cadA_2 cadA_3
[16:40:27] Fixed 2 duplicate /gene - xerC_1 xerC_2
[16:40:27] Fixed 4 duplicate /gene - gcvA_1 gcvA_2 gcvA_3 gcvA_4
[16:40:27] Fixed 3 duplicate /gene - dmdC_1 dmdC_2 dmdC_3
[16:40:27] Fixed 2 duplicate /gene - ftsA_1 ftsA_2
[16:40:27] Fixed 3 duplicate /gene - cheB_1 cheB_2 cheB_3
[16:40:27] Fixed 2 duplicate /gene - amtB_1 amtB_2
[16:40:27] Fixed 2 duplicate /gene - msrAB_1 msrAB_2
[16:40:27] Fixed 3 duplicate /gene - sspA_1 sspA_2 sspA_3
[16:40:27] Fixed 5 duplicate /gene - oppF_1 oppF_2 oppF_3 oppF_4 oppF_5
[16:40:27] Fixed 3 duplicate /gene - cc4_1 cc4_2 cc4_3
[16:40:27] Fixed 12 duplicate /gene - dmlR_1 dmlR_2 dmlR_3 dmlR_4 dmlR_5 dmlR_6 dmlR_7 dmlR_8 dmlR_9 dmlR_10 dmlR_11 dmlR_12
[16:40:27] Fixed 2 duplicate /gene - speE_1 speE_2
[16:40:27] Fixed 2 duplicate /gene - glrR_1 glrR_2
[16:40:27] Fixed 2 duplicate /gene - lolA_1 lolA_2
[16:40:27] Fixed 2 duplicate /gene - algE5_1 algE5_2
[16:40:27] Fixed 2 duplicate /gene - infA_1 infA_2
[16:40:27] Fixed 3 duplicate /gene - soxA_1 soxA_2 soxA_3
[16:40:27] Fixed 3 duplicate /gene - crnA_1 crnA_2 crnA_3
[16:40:27] Fixed 4 duplicate /gene - cya_1 cya_2 cya_3 cya_4
[16:40:27] Fixed 4 duplicate /gene - siaM_1 siaM_2 siaM_3 siaM_4
[16:40:27] Fixed 2 duplicate /gene - gloB_1 gloB_2
[16:40:27] Fixed 2 duplicate /gene - acdA_1 acdA_2
[16:40:27] Fixed 2 duplicate /gene - qseB_1 qseB_2
[16:40:27] Fixed 3 duplicate /gene - ddpF_1 ddpF_2 ddpF_3
[16:40:27] Fixed 2 duplicate /gene - yphB_1 yphB_2
[16:40:27] Fixed 2 duplicate /gene - siaQ_1 siaQ_2
[16:40:27] Fixed 5 duplicate /gene - tar_1 tar_2 tar_3 tar_4 tar_5
[16:40:27] Fixed 2 duplicate /gene - metH_1 metH_2
[16:40:27] Fixed 4 duplicate /gene - clsC_1 clsC_2 clsC_3 clsC_4
[16:40:27] Fixed 3 duplicate /gene - iorA_1 iorA_2 iorA_3
[16:40:27] Fixed 2 duplicate /gene - glpQ_1 glpQ_2
[16:40:27] Fixed 2 duplicate /gene - mexR_1 mexR_2
[16:40:27] Fixed 3 duplicate /gene - xerD_1 xerD_2 xerD_3
[16:40:27] Fixed 2 duplicate /gene - btuF_1 btuF_2
[16:40:27] Fixed 4 duplicate /gene - cheY_1 cheY_2 cheY_3 cheY_4
[16:40:28] Fixed 2 duplicate /gene - gstB_1 gstB_2
[16:40:28] Fixed 2 duplicate /gene - pyrE_1 pyrE_2
[16:40:28] Fixed 2 duplicate /gene - ndmA_1 ndmA_2
[16:40:28] Fixed 2 duplicate /gene - sdpR_1 sdpR_2
[16:40:28] Fixed 2 duplicate /gene - slyD_1 slyD_2
[16:40:28] Fixed 5 duplicate /gene - cyaB_1 cyaB_2 cyaB_3 cyaB_4 cyaB_5
[16:40:28] Fixed 2 duplicate /gene - aroK_1 aroK_2
[16:40:28] Fixed 2 duplicate /gene - ypdB_1 ypdB_2
[16:40:28] Fixed 3 duplicate /gene - hxuA_1 hxuA_2 hxuA_3
[16:40:28] Fixed 2 duplicate /gene - evgS_1 evgS_2
[16:40:28] Fixed 2 duplicate /gene - ccpA_1 ccpA_2
[16:40:28] Fixed 3 duplicate /gene - rclR_1 rclR_2 rclR_3
[16:40:28] Fixed 2 duplicate /gene - iclR_1 iclR_2
[16:40:28] Fixed 3 duplicate /gene - ais_1 ais_2 ais_3
[16:40:28] Fixed 2 duplicate /gene - czcB_1 czcB_2
[16:40:28] Fixed 2 duplicate /gene - zraR_1 zraR_2
[16:40:28] Fixed 2 duplicate /gene - mdtB_1 mdtB_2
[16:40:28] Fixed 384 colliding /gene names.
[16:40:28] Adding /locus_tag identifiers
[16:40:28] Assigned 4339 locus_tags to CDS and RNA features.
[16:40:28] Writing outputs to data/annotation/bin1/
[16:40:31] Generating annotation statistics file
[16:40:31] Generating Genbank and Sequin files
[16:40:31] Running: tbl2asn -V b -a r10k -l paired-ends -M n -N 1 -y 'Annotated using prokka 1.13 from https://github.com/tseemann/prokka' -Z data\/annotation\/bin1\/PROKKA_03222022\.err -i data\/annotation\/bin1\/PROKKA_03222022\.fsa 2> /dev/null
[16:40:40] Deleting unwanted file: data/annotation/bin1/errorsummary.val
[16:40:40] Deleting unwanted file: data/annotation/bin1/PROKKA_03222022.dr
[16:40:40] Deleting unwanted file: data/annotation/bin1/PROKKA_03222022.fixedproducts
[16:40:40] Deleting unwanted file: data/annotation/bin1/PROKKA_03222022.ecn
[16:40:40] Deleting unwanted file: data/annotation/bin1/PROKKA_03222022.val
[16:40:40] Repairing broken .GBK output that tbl2asn produces...
[16:40:40] Running: sed 's/COORDINATES: profile/COORDINATES:profile/' < data\/annotation\/bin1\/PROKKA_03222022\.gbf > data\/annotation\/bin1\/PROKKA_03222022\.gbk
[16:40:40] Deleting unwanted file: data/annotation/bin1/PROKKA_03222022.gbf
[16:40:40] Output files:
[16:40:40] data/annotation/bin1/PROKKA_03222022.log
[16:40:40] data/annotation/bin1/PROKKA_03222022.fna
[16:40:40] data/annotation/bin1/PROKKA_03222022.gff
[16:40:40] data/annotation/bin1/PROKKA_03222022.faa
[16:40:40] data/annotation/bin1/PROKKA_03222022.ffn
[16:40:40] data/annotation/bin1/PROKKA_03222022.tbl
[16:40:40] data/annotation/bin1/PROKKA_03222022.fsa
[16:40:40] data/annotation/bin1/PROKKA_03222022.tsv
[16:40:40] data/annotation/bin1/PROKKA_03222022.txt
[16:40:40] data/annotation/bin1/PROKKA_03222022.sqn
[16:40:40] data/annotation/bin1/PROKKA_03222022.err
[16:40:40] data/annotation/bin1/PROKKA_03222022.gbk
[16:40:40] Annotation finished successfully.
[16:40:40] Walltime used: 6.00 minutes
[16:40:40] If you use this result please cite the Prokka paper:
[16:40:40] Seemann T (2014) Prokka: rapid prokaryotic genome annotation. Bioinformatics. 30(14):2068-9.
[16:40:40] Type 'prokka --citation' for more details.
[16:40:40] Thank you, come again.
(metagenomics_practical) 

Investigating prokka output

The prokka output is very elaborate and can be used to many ends. We will quickly visualise the output for the purpose of this practical. To investigate the prokka output, you can use two webservers that both can place the annotations from prokka in metabolic KEGG pathways. First, we'll inspect the contents of a GFF file.

[DO:] View prokka output: (look for the .gff file)

In [7]:
ls data/annotation/bin1
PROKKA_03222022.err  PROKKA_03222022.fsa  PROKKA_03222022.sqn
PROKKA_03222022.faa  PROKKA_03222022.gbk  PROKKA_03222022.tbl
PROKKA_03222022.ffn  PROKKA_03222022.gff  PROKKA_03222022.tsv
PROKKA_03222022.fna  PROKKA_03222022.log  PROKKA_03222022.txt
(metagenomics_practical) 

In [8]:
head data/annotation/bin1/PROKKA_03222022.txt
organism: Genus species strain 
contigs: 37
bases: 4425020
gene: 4339
tRNA: 44
CDS: 4294
tmRNA: 1
(metagenomics_practical) 

In [1]:
grep -v '#' data/annotation/bin1/PROKKA_03222022.gff | head
gnl|X|HFHIMAGJ_1	Prodigal:2.6	CDS	394	1284	.	-	0	ID=HFHIMAGJ_00001;Parent=HFHIMAGJ_00001_gene;inference=ab initio prediction:Prodigal:2.6;locus_tag=HFHIMAGJ_00001;product=hypothetical protein;protein_id=gnl|X|HFHIMAGJ_00001
gnl|X|HFHIMAGJ_1	prokka	gene	394	1284	.	-	.	ID=HFHIMAGJ_00001_gene;locus_tag=HFHIMAGJ_00001
gnl|X|HFHIMAGJ_1	Prodigal:2.6	CDS	1291	1719	.	-	0	ID=HFHIMAGJ_00002;Parent=HFHIMAGJ_00002_gene;Name=exbD_1;dbxref=COG:COG0848;gene=exbD_1;inference=ab initio prediction:Prodigal:2.6,similar to AA sequence:UniProtKB:P0ABV2;locus_tag=HFHIMAGJ_00002;product=Biopolymer transport protein ExbD;protein_id=gnl|X|HFHIMAGJ_00002
gnl|X|HFHIMAGJ_1	prokka	gene	1291	1719	.	-	.	ID=HFHIMAGJ_00002_gene;Name=exbD_1;gene=exbD_1;locus_tag=HFHIMAGJ_00002
gnl|X|HFHIMAGJ_1	Prodigal:2.6	CDS	1721	2413	.	-	0	ID=HFHIMAGJ_00003;Parent=HFHIMAGJ_00003_gene;Name=tolQ_1;dbxref=COG:COG0811;gene=tolQ_1;inference=ab initio prediction:Prodigal:2.6,similar to AA sequence:UniProtKB:P0ABU9;locus_tag=HFHIMAGJ_00003;product=Protein TolQ;protein_id=gnl|X|HFHIMAGJ_00003
gnl|X|HFHIMAGJ_1	prokka	gene	1721	2413	.	-	.	ID=HFHIMAGJ_00003_gene;Name=tolQ_1;gene=tolQ_1;locus_tag=HFHIMAGJ_00003
gnl|X|HFHIMAGJ_1	Prodigal:2.6	CDS	2410	2811	.	-	0	ID=HFHIMAGJ_00004;Parent=HFHIMAGJ_00004_gene;eC_number=3.1.2.-;Name=ybgC;dbxref=COG:COG0824;gene=ybgC;inference=ab initio prediction:Prodigal:2.6,similar to AA sequence:UniProtKB:P44679;locus_tag=HFHIMAGJ_00004;product=Acyl-CoA thioesterase YbgC;protein_id=gnl|X|HFHIMAGJ_00004
gnl|X|HFHIMAGJ_1	prokka	gene	2410	2811	.	-	.	ID=HFHIMAGJ_00004_gene;Name=ybgC;gene=ybgC;locus_tag=HFHIMAGJ_00004
gnl|X|HFHIMAGJ_1	Prodigal:2.6	CDS	2808	4001	.	-	0	ID=HFHIMAGJ_00005;Parent=HFHIMAGJ_00005_gene;eC_number=2.6.1.-;Name=patA;dbxref=COG:COG0436;gene=patA;inference=ab initio prediction:Prodigal:2.6,similar to AA sequence:UniProtKB:P16524;locus_tag=HFHIMAGJ_00005;product=Putative N-acetyl-LL-diaminopimelate aminotransferase;protein_id=gnl|X|HFHIMAGJ_00005
gnl|X|HFHIMAGJ_1	prokka	gene	2808	4001	.	-	.	ID=HFHIMAGJ_00005_gene;Name=patA;gene=patA;locus_tag=HFHIMAGJ_00005
grep: write error: Broken pipe

Visualisation in IPath3 onpathways.embl.de

For visualisation in pathways.embl.de we need to add the word 'UNIRPOT:' to the list of IDs like so:

Prokka gives UniProt IDs in the gff files first we will collect these IDs and discard all other information. However, we need to make sure that IPath3 understands these are UniprotIDs. The code below helps you to do so.

[DO:] take the uniprot IDs out of the gff file:

In [4]:
grep -o 'UniProt.*' data/annotation/bin1/PROKKA_03222022.gff| cut -d';' -f1 | cut -d':' -f2 | sed 's/^/UNIPROT:/g'
UNIPROT:P0ABV2
UNIPROT:P0ABU9
UNIPROT:P44679
UNIPROT:P16524
UNIPROT:P0A780
UNIPROT:P11998
UNIPROT:A5U2B7
UNIPROT:P51961
UNIPROT:P25539
UNIPROT:P02974
UNIPROT:S0HPF7
UNIPROT:P0A8D0
UNIPROT:P0A2E2
UNIPROT:P0AGC3
UNIPROT:P76346
UNIPROT:Q9I2V0
UNIPROT:P83221
UNIPROT:P23869
UNIPROT:P21888
UNIPROT:P37878
UNIPROT:Q9HXZ2
UNIPROT:P52097
UNIPROT:C3JXY0
UNIPROT:Q9KR66
UNIPROT:P0ACG8
UNIPROT:P45336
UNIPROT:P0CL17
UNIPROT:B3EWP1
UNIPROT:P14375
UNIPROT:P21175
UNIPROT:P45073
UNIPROT:P22731
UNIPROT:P0AEX7
UNIPROT:P25026
UNIPROT:P23871
UNIPROT:P0ADP0
UNIPROT:P37269
UNIPROT:Q6N3F2
UNIPROT:P76397
UNIPROT:P13511
UNIPROT:Q9JZ37
UNIPROT:Q9JZ38
UNIPROT:P0A6H1
UNIPROT:P0A9M0
UNIPROT:O05412
UNIPROT:P14407
UNIPROT:P33950
UNIPROT:P75798
UNIPROT:P75799
UNIPROT:P24137
UNIPROT:P24136
UNIPROT:Q6DTN4
UNIPROT:P05042
UNIPROT:P13738
UNIPROT:P27550
UNIPROT:P15452
UNIPROT:P9WKJ5
UNIPROT:Q47141
UNIPROT:P60955
UNIPROT:P0ACM2
UNIPROT:B3EWP1
UNIPROT:O07610
UNIPROT:Q9R381
UNIPROT:Q57252
UNIPROT:P0ACM2
UNIPROT:P55819
UNIPROT:B3EWP1
UNIPROT:P76518
UNIPROT:P52046
UNIPROT:Q9ZHQ1
UNIPROT:P75797
UNIPROT:P75798
UNIPROT:P26904
UNIPROT:P24136
UNIPROT:P24137
UNIPROT:P45493
UNIPROT:P07658
UNIPROT:A8GQT6
UNIPROT:P69902
UNIPROT:P23222
UNIPROT:P69902
UNIPROT:P40149
UNIPROT:B3EWP1
UNIPROT:O06724
UNIPROT:Q55459
UNIPROT:P23882
UNIPROT:P23878
UNIPROT:Q56992
UNIPROT:O32102
UNIPROT:P40883
UNIPROT:P17315
UNIPROT:Q9HTW3
UNIPROT:P06129
UNIPROT:P29931
UNIPROT:P37028
UNIPROT:Q56992
UNIPROT:P15031
UNIPROT:P31570
UNIPROT:P9WP93
UNIPROT:P97084
UNIPROT:P9WP95
UNIPROT:Q7SIC7
UNIPROT:Q7A551
UNIPROT:Q9KJX5
UNIPROT:P76397
UNIPROT:B7M458
UNIPROT:A7ZNP9
UNIPROT:Q51487
UNIPROT:A9BJC1
UNIPROT:Q08IS0
UNIPROT:P15452
UNIPROT:O08498
UNIPROT:O53838
UNIPROT:P33015
UNIPROT:P0AB83
UNIPROT:Q9F1R2
UNIPROT:B3EWP1
UNIPROT:O05218
UNIPROT:Q9F466
UNIPROT:P76114
UNIPROT:P0AFR9
UNIPROT:P0AFK4
UNIPROT:P0AFK9
UNIPROT:P69874
UNIPROT:P9WP07
UNIPROT:A0A0K8P8E7
UNIPROT:P42419
UNIPROT:P80064
UNIPROT:P0A6D5
UNIPROT:P37676
UNIPROT:Q9KR66
UNIPROT:P37674
UNIPROT:Q88FX7
UNIPROT:Q72LL6
UNIPROT:P23836
UNIPROT:D0ZV89
UNIPROT:P77526
UNIPROT:O32242
UNIPROT:P21161
UNIPROT:P52003
UNIPROT:Q46871
UNIPROT:O34360
UNIPROT:P16575
UNIPROT:P71040
UNIPROT:P9WNH5
UNIPROT:B3EWP1
UNIPROT:P0A0H9
UNIPROT:A0A0A1H8I4
UNIPROT:Q07806
UNIPROT:O34816
UNIPROT:O32130
UNIPROT:P25888
UNIPROT:P46336
UNIPROT:P08245
UNIPROT:P10955
UNIPROT:P14930
UNIPROT:P0A115
UNIPROT:O34349
UNIPROT:A0R3F9
UNIPROT:Q6STM1
UNIPROT:A0QYG2
UNIPROT:P0AGG4
UNIPROT:P37662
UNIPROT:P0A9F6
UNIPROT:P42251
UNIPROT:P23545
UNIPROT:Q12A06
UNIPROT:P30958
UNIPROT:Q2SWT6
UNIPROT:A3NWD9
UNIPROT:P0AEJ4
UNIPROT:P0AA16
UNIPROT:O31502
UNIPROT:O86034
UNIPROT:P0AG20
UNIPROT:P9WIY9
UNIPROT:A5W4E2
UNIPROT:A5W4E3
UNIPROT:P23222
UNIPROT:Q8KR08
UNIPROT:A5W4E2
UNIPROT:P0AEI4
UNIPROT:P21499
UNIPROT:Q7A3L9
UNIPROT:P31806
UNIPROT:Q9HTQ0
UNIPROT:Q38M35
UNIPROT:P66540
UNIPROT:P0A6P1
UNIPROT:P65932
UNIPROT:Q8GRF5
UNIPROT:P60473
UNIPROT:P0ABG1
UNIPROT:P45568
UNIPROT:P0AEH1
UNIPROT:Q9K1H0
UNIPROT:P31519
UNIPROT:Q9HXY6
UNIPROT:Q2SWY7
UNIPROT:Q2SWY6
UNIPROT:P10441
UNIPROT:P10442
UNIPROT:Q7A794
UNIPROT:P0A6F1
UNIPROT:P00968
UNIPROT:P0A6W5
UNIPROT:P71376
UNIPROT:P0C0R7
UNIPROT:P0AAI3
UNIPROT:P0AC13
UNIPROT:P31120
UNIPROT:P02941
UNIPROT:P9WMY5
UNIPROT:P0DP44
UNIPROT:P76502
UNIPROT:P0AER8
UNIPROT:P64423
UNIPROT:Q9I154
UNIPROT:Q9S605
UNIPROT:P0A669
UNIPROT:P0AG82
UNIPROT:P0AGH8
UNIPROT:P07654
UNIPROT:P0AAH0
UNIPROT:Q51547
UNIPROT:P0AFJ5
UNIPROT:P08400
UNIPROT:O82855
UNIPROT:Q2FWD8
UNIPROT:P77376
UNIPROT:P0AG30
UNIPROT:P0AA27
UNIPROT:A2RH77
UNIPROT:P0A8B5
UNIPROT:Q8RDI4
UNIPROT:Q47538
UNIPROT:A9WGD1
UNIPROT:P33015
UNIPROT:P0AEZ7
UNIPROT:P0AC84
UNIPROT:P0A7Y4
UNIPROT:P37636
UNIPROT:Q8G2M6
UNIPROT:P94562
UNIPROT:P26993
UNIPROT:Q84BZ3
UNIPROT:Q84BZ2
UNIPROT:Q84BZ1
UNIPROT:P0AEX7
UNIPROT:P45073
UNIPROT:P22731
UNIPROT:P12281
UNIPROT:P32173
UNIPROT:P69848
UNIPROT:P94681
UNIPROT:P23883
UNIPROT:P23222
UNIPROT:P94548
UNIPROT:O68460
UNIPROT:P0A7A9
UNIPROT:Q03638
UNIPROT:P0A9Z1
UNIPROT:O06986
UNIPROT:P0ABN1
UNIPROT:O05404
UNIPROT:P00893
UNIPROT:P00894
UNIPROT:C1DFH7
UNIPROT:P9WQB3
UNIPROT:Q9JZG1
UNIPROT:P0AFI5
UNIPROT:P37671
UNIPROT:Q983E4
UNIPROT:P0A8P1
UNIPROT:P39593
UNIPROT:P71350
UNIPROT:P55882
UNIPROT:Q5SL28
UNIPROT:P0A123
UNIPROT:P23845
UNIPROT:P9WIK1
UNIPROT:P17854
UNIPROT:P9WJ03
UNIPROT:P77182
UNIPROT:Q3JQJ0
UNIPROT:P0AG78
UNIPROT:P80644
UNIPROT:P75853
UNIPROT:P75853
UNIPROT:O85764
UNIPROT:P75851
UNIPROT:P75853
UNIPROT:Q8KZQ6
UNIPROT:P08854
UNIPROT:P75853
UNIPROT:P75853
UNIPROT:P75853
UNIPROT:P54998
UNIPROT:P40854
UNIPROT:P30750
UNIPROT:P31547
UNIPROT:P28635
UNIPROT:P07062
UNIPROT:P0A9G2
UNIPROT:P44558
UNIPROT:P97027
UNIPROT:P40401
UNIPROT:P40400
UNIPROT:P54998
UNIPROT:P10907
UNIPROT:P9WG01
UNIPROT:P9WG03
UNIPROT:A0QXD9
UNIPROT:Q9WWU5
UNIPROT:P37610
UNIPROT:Q55463
UNIPROT:P75851
UNIPROT:P14532
UNIPROT:O52866
UNIPROT:Q7A3L9
UNIPROT:O34723
UNIPROT:P38055
UNIPROT:P9WMG1
UNIPROT:P9WMG1
UNIPROT:Q6DTN4
UNIPROT:P22862
UNIPROT:Q56232
UNIPROT:A9WGD1
UNIPROT:A9WGD1
UNIPROT:P97027
UNIPROT:P39644
UNIPROT:P0ACM2
UNIPROT:P10088
UNIPROT:Q8GHB1
UNIPROT:A9WGD2
UNIPROT:O86041
UNIPROT:P42328
UNIPROT:Q9KKU0
UNIPROT:D3KVM6
UNIPROT:Q88IY3
UNIPROT:P76250
UNIPROT:P22731
UNIPROT:P0A9V1
UNIPROT:P0AEX7
UNIPROT:P0AG34
UNIPROT:Q8YEE8
UNIPROT:Q8GQN9
UNIPROT:Q9HYH1
UNIPROT:P0A6D7
UNIPROT:Q84HH6
UNIPROT:Q9AIX7
UNIPROT:P05102
UNIPROT:Q55168
UNIPROT:P77526
UNIPROT:Q9KJE8
UNIPROT:P9WMH2
UNIPROT:P77467
UNIPROT:P76083
UNIPROT:Q0GQS6
UNIPROT:Q9X1C1
UNIPROT:P0A7C2
UNIPROT:Q9ZLB9
UNIPROT:J7RC67
UNIPROT:Q9HZM5
UNIPROT:A1IQS5
UNIPROT:P27300
UNIPROT:P0ABV2
UNIPROT:P0ABU9
UNIPROT:P04994
UNIPROT:D3KVM5
UNIPROT:Q02NB5
UNIPROT:P16100
UNIPROT:Q9S1B7
UNIPROT:Q9A5I0
UNIPROT:P0ABH9
UNIPROT:Q8GR70
UNIPROT:Q9F0P1
UNIPROT:Q0K020
UNIPROT:P0A9J4
UNIPROT:B3EWP1
UNIPROT:P77397
UNIPROT:O06724
UNIPROT:B3EWP1
UNIPROT:P52679
UNIPROT:P37251
UNIPROT:B3EWP1
UNIPROT:P40408
UNIPROT:Q7A3L9
UNIPROT:P77333
UNIPROT:O67517
UNIPROT:B3EWP1
UNIPROT:P61167
UNIPROT:P37597
UNIPROT:B3EWP1
UNIPROT:A1B497
UNIPROT:P42435
UNIPROT:P0A9I8
UNIPROT:P0AC25
UNIPROT:Q04J43
UNIPROT:P08337
UNIPROT:P21499
UNIPROT:Q83AY0
UNIPROT:B3EWP1
UNIPROT:P0AFR2
UNIPROT:P25524
UNIPROT:P0AAG8
UNIPROT:Q99TQ0
UNIPROT:Q2YKI6
UNIPROT:P67396
UNIPROT:A5U2B7
UNIPROT:A4F2N8
UNIPROT:Q51423
UNIPROT:P15640
UNIPROT:P36553
UNIPROT:P54455
UNIPROT:Q7P0P8
UNIPROT:P0A8I8
UNIPROT:P25536
UNIPROT:P0A9J0
UNIPROT:Q9R9D5
UNIPROT:Q9KQW9
UNIPROT:Q7WTB1
UNIPROT:H2K893
UNIPROT:Q83F59
UNIPROT:P0A912
UNIPROT:P45955
UNIPROT:Q46927
UNIPROT:P63389
UNIPROT:Q5F973
UNIPROT:P39199
UNIPROT:Q9JYL2
UNIPROT:P24559
UNIPROT:Q5DL43
UNIPROT:A0LEA5
UNIPROT:Q9I310
UNIPROT:P51834
UNIPROT:P15042
UNIPROT:P0AAR5
UNIPROT:P0ACV0
UNIPROT:Q93LE9
UNIPROT:B3H2N1
UNIPROT:P36223
UNIPROT:Q9ZCD3
UNIPROT:P15254
UNIPROT:J7QLC0
UNIPROT:P77526
UNIPROT:P36767
UNIPROT:P75919
UNIPROT:P0AEJ0
UNIPROT:P27303
UNIPROT:Q9KWV3
UNIPROT:P69989
UNIPROT:Q7A4T3
UNIPROT:P54570
UNIPROT:A1B479
UNIPROT:P0AFE8
UNIPROT:P9WIW1
UNIPROT:P29923
UNIPROT:P0AFE0
UNIPROT:A1B486
UNIPROT:A1B487
UNIPROT:P29915
UNIPROT:A8GQT6
UNIPROT:Q56221
UNIPROT:A1B495
UNIPROT:P29917
UNIPROT:A1B497
UNIPROT:P19045
UNIPROT:P0AG99
UNIPROT:B1XB85
UNIPROT:Q7TXL8
UNIPROT:Q9K062
UNIPROT:P0ADZ4
UNIPROT:Q9X0Y2
UNIPROT:P04816
UNIPROT:P40731
UNIPROT:Q48468
UNIPROT:P73452
UNIPROT:Q55461
UNIPROT:Q55463
UNIPROT:P9WGM3
UNIPROT:P42435
UNIPROT:P42436
UNIPROT:P39458
UNIPROT:P30177
UNIPROT:O34744
UNIPROT:P44941
UNIPROT:P66519
UNIPROT:P54464
UNIPROT:P0AFK0
UNIPROT:Q87WS9
UNIPROT:P0AF03
UNIPROT:P67765
UNIPROT:P0AE01
UNIPROT:P0ADG4
UNIPROT:P39285
UNIPROT:G2JZ44
UNIPROT:O53732
UNIPROT:Q3IYV6
UNIPROT:P40685
UNIPROT:P0A9H7
UNIPROT:P0A1Y0
UNIPROT:P60844
UNIPROT:P60932
UNIPROT:P0AD14
UNIPROT:P0AFT5
UNIPROT:P16575
UNIPROT:Q9I2U8
UNIPROT:K9NBS6
UNIPROT:Q7CXI0
UNIPROT:P15321
UNIPROT:P44602
UNIPROT:P07061
UNIPROT:P30897
UNIPROT:P24734
UNIPROT:P67153
UNIPROT:P30871
UNIPROT:O31743
UNIPROT:P0AFN6
UNIPROT:P36879
UNIPROT:Q55168
UNIPROT:P38097
UNIPROT:P05704
UNIPROT:P9WLZ7
UNIPROT:Q7A7N0
UNIPROT:P96825
UNIPROT:Q55168
UNIPROT:Q9ZBU1
UNIPROT:P0A6W9
UNIPROT:Q2YPC1
UNIPROT:Q53547
UNIPROT:P0A9F6
UNIPROT:B3EWP1
UNIPROT:P72349
UNIPROT:Q9HTQ0
UNIPROT:Q8PHA1
UNIPROT:Q93KD6
UNIPROT:P10346
UNIPROT:Q93KD5
UNIPROT:G7CFI3
UNIPROT:O07838
UNIPROT:P9WFJ7
UNIPROT:P46336
UNIPROT:Q3J1R2
UNIPROT:Q3J1R2
UNIPROT:Q3J1R2
UNIPROT:Q9KGU7
UNIPROT:Q08291
UNIPROT:Q7VV85
UNIPROT:P0ABR7
UNIPROT:P9WHF5
UNIPROT:Q3S8R0
UNIPROT:P9WLC7
UNIPROT:Q05918
UNIPROT:Q5E7H1
UNIPROT:P0A8H3
UNIPROT:P0A115
UNIPROT:P70994
UNIPROT:P00582
UNIPROT:Q8UHA8
UNIPROT:Q6XL52
UNIPROT:P03018
UNIPROT:O05269
UNIPROT:P9WK87
UNIPROT:Q2TDY4
UNIPROT:P28246
UNIPROT:P25888
UNIPROT:P0A784
UNIPROT:Q02I31
UNIPROT:Q8ZKB0
UNIPROT:P9WP93
UNIPROT:Q9K0K5
UNIPROT:B2JF31
UNIPROT:Q6F7I0
UNIPROT:P0A7T3
UNIPROT:Q9HUU7
UNIPROT:P46854
UNIPROT:P0AFJ7
UNIPROT:P0AA37
UNIPROT:O33950
UNIPROT:Q8RJB2
UNIPROT:P64581
UNIPROT:A0R5M8
UNIPROT:G7CFI3
UNIPROT:O34812
UNIPROT:P9WLZ3
UNIPROT:Q56837
UNIPROT:Q7A6J4
UNIPROT:Q98GN8
UNIPROT:Q8ZL58
UNIPROT:P0DOA0
UNIPROT:Q79PF4
UNIPROT:P69451
UNIPROT:B3EWP1
UNIPROT:K9NBS6
UNIPROT:P77608
UNIPROT:P0ACM2
UNIPROT:P0AEM9
UNIPROT:O34671
UNIPROT:P0AFT2
UNIPROT:O34677
UNIPROT:P37965
UNIPROT:Q3S4B7
UNIPROT:P09339
UNIPROT:Q9HWR3
UNIPROT:P0A9T0
UNIPROT:Q9HZ66
UNIPROT:P0AFY6
UNIPROT:P36683
UNIPROT:Q3J5L6
UNIPROT:Q9ZF99
UNIPROT:P45544
UNIPROT:P69054
UNIPROT:P51057
UNIPROT:G4V4G6
UNIPROT:P07014
UNIPROT:P14165
UNIPROT:P39647
UNIPROT:P0A6A6
UNIPROT:P04787
UNIPROT:Q2T7H6
UNIPROT:Q51344
UNIPROT:P07649
UNIPROT:Q56320
UNIPROT:Q2KE82
UNIPROT:Q5ZVY3
UNIPROT:P0A9Q5
UNIPROT:Q0P9D4
UNIPROT:Q795J3
UNIPROT:P0ABR7
UNIPROT:P71063
UNIPROT:P50941
UNIPROT:P99159
UNIPROT:Q8NXE2
UNIPROT:Q0P8T0
UNIPROT:Q5ZXH9
UNIPROT:Q7SIC0
UNIPROT:Q9X0C6
UNIPROT:Q0P8S7
UNIPROT:P54420
UNIPROT:Q56770
UNIPROT:Q2SXD6
UNIPROT:P19993
UNIPROT:P42597
UNIPROT:Q9HZK0
UNIPROT:Q9KPB6
UNIPROT:Q02HS5
UNIPROT:P0A7H3
UNIPROT:P06616
UNIPROT:P0A7Y0
UNIPROT:P00803
UNIPROT:P60785
UNIPROT:P73354
UNIPROT:P38108
UNIPROT:D0ZSY9
UNIPROT:Q9KQH9
UNIPROT:P80918
UNIPROT:Q9KQH7
UNIPROT:P0AAI9
UNIPROT:A1KRY9
UNIPROT:P27247
UNIPROT:Q9HZN4
UNIPROT:P0AB28
UNIPROT:P0A729
UNIPROT:P67087
UNIPROT:O34525
UNIPROT:Q9JMQ2
UNIPROT:P0AA39
UNIPROT:P21513
UNIPROT:P09127
UNIPROT:P48246
UNIPROT:P06983
UNIPROT:A0QWX4
UNIPROT:P20471
UNIPROT:P67127
UNIPROT:P0AGM0
UNIPROT:P0AE39
UNIPROT:P0AD14
UNIPROT:Q9LAE5
UNIPROT:P76578
UNIPROT:P76577
UNIPROT:P24559
UNIPROT:B3EWP1
UNIPROT:P0CL17
UNIPROT:P40719
UNIPROT:P77622
UNIPROT:P0AAG0
UNIPROT:P77463
UNIPROT:P75798
UNIPROT:P83772
UNIPROT:P69782
UNIPROT:P33950
UNIPROT:P37595
UNIPROT:Q5P5I4
UNIPROT:O07427
UNIPROT:O31158
UNIPROT:Q52011
UNIPROT:Q2YQ23
UNIPROT:O34723
UNIPROT:P0DOA0
UNIPROT:P0A9Q1
UNIPROT:P75797
UNIPROT:P48632
UNIPROT:P0AC62
UNIPROT:D0ZLR3
UNIPROT:P33232
UNIPROT:M1V1V5
UNIPROT:Q47155
UNIPROT:A1KFR7
UNIPROT:P52046
UNIPROT:Q9KTK0
UNIPROT:P0A8I3
UNIPROT:P0A6X1
UNIPROT:P0A7I0
UNIPROT:P0ACC1
UNIPROT:P0AC69
UNIPROT:P37665
UNIPROT:P45493
UNIPROT:P77622
UNIPROT:P76027
UNIPROT:P75799
UNIPROT:P75798
UNIPROT:P75797
UNIPROT:Q9I502
UNIPROT:P0A776
UNIPROT:Q2SZF9
UNIPROT:Q02GB1
UNIPROT:P0A7L8
UNIPROT:P0AG48
UNIPROT:P0AD57
UNIPROT:P37093
UNIPROT:P41441
UNIPROT:P31712
UNIPROT:P0A6I9
UNIPROT:Q87LT3
UNIPROT:P0A8H8
UNIPROT:P77788
UNIPROT:Q9Z4S1
UNIPROT:Q9PFB0
UNIPROT:O86042
UNIPROT:Q8YSQ6
UNIPROT:Q9HU16
UNIPROT:M9XB82
UNIPROT:Q1LRY0
UNIPROT:P32010
UNIPROT:P22731
UNIPROT:P0AEX7
UNIPROT:Q9RPT1
UNIPROT:P83736
UNIPROT:P0A8T1
UNIPROT:P37798
UNIPROT:P0ABD8
UNIPROT:Q81SZ9
UNIPROT:P37773
UNIPROT:P0AB91
UNIPROT:P0AGG8
UNIPROT:Q97PA9
UNIPROT:Q03228
UNIPROT:Q0KBC7
UNIPROT:P0ABG7
UNIPROT:P30850
UNIPROT:Q9KVT3
UNIPROT:P46022
UNIPROT:Q55168
UNIPROT:P0A814
UNIPROT:P47205
UNIPROT:P0A9A6
UNIPROT:Q9K0X8
UNIPROT:P04693
UNIPROT:A0QQ70
UNIPROT:O69061
UNIPROT:A0QQ68
UNIPROT:A4IP64
UNIPROT:P24197
UNIPROT:P77223
UNIPROT:P05704
UNIPROT:P43387
UNIPROT:Q3J4E3
UNIPROT:P9WPZ1
UNIPROT:P27253
UNIPROT:P0ACM2
UNIPROT:P45857
UNIPROT:P44839
UNIPROT:H6LGM6
UNIPROT:P9WNN7
UNIPROT:Q7X281
UNIPROT:A9GMG8
UNIPROT:P9WI85
UNIPROT:P32695
UNIPROT:P13800
UNIPROT:P0A8X2
UNIPROT:Q68AP4
UNIPROT:P21367
UNIPROT:P9WMG1
UNIPROT:P9WMG1
UNIPROT:P80880
UNIPROT:Q7A2Q1
UNIPROT:C1I202
UNIPROT:P9WMS1
UNIPROT:P9WMG1
UNIPROT:P0A3G4
UNIPROT:C0LA90
UNIPROT:Q5XW77
UNIPROT:B3EWP1
UNIPROT:P10151
UNIPROT:G4V4T7
UNIPROT:P75897
UNIPROT:Q6FFQ1
UNIPROT:P76584
UNIPROT:Q6BF17
UNIPROT:P9WG01
UNIPROT:P9WG03
UNIPROT:P10907
UNIPROT:P0ACL7
UNIPROT:P37772
UNIPROT:P39328
UNIPROT:Q8G847
UNIPROT:P39325
UNIPROT:P42329
UNIPROT:P52661
UNIPROT:P0AFS7
UNIPROT:Q51487
UNIPROT:P37683
UNIPROT:P0DOV5
UNIPROT:Q1JUP5
UNIPROT:P05149
UNIPROT:Q1JUQ0
UNIPROT:P14697
UNIPROT:Q8ZLF4
UNIPROT:P0AG80
UNIPROT:P29823
UNIPROT:P94530
UNIPROT:B3EWP1
UNIPROT:A9WGD2
UNIPROT:P14611
UNIPROT:P23608
UNIPROT:P33644
UNIPROT:O86043
UNIPROT:Q8GAL4
UNIPROT:Q05183
UNIPROT:Q44249
UNIPROT:P00259
UNIPROT:P37665
UNIPROT:P69782
UNIPROT:P80880
UNIPROT:Q7X284
UNIPROT:O86042
UNIPROT:Q315G1
UNIPROT:O34507
UNIPROT:P40138
UNIPROT:P33358
UNIPROT:Q51472
UNIPROT:P07395
UNIPROT:P08312
UNIPROT:Q6NDR6
UNIPROT:Q6NDR5
UNIPROT:P0A707
UNIPROT:P0A8M3
UNIPROT:P0A9G6
UNIPROT:P39759
UNIPROT:P96582
UNIPROT:P0AA70
UNIPROT:P0DP90
UNIPROT:Q88FX7
UNIPROT:B3EWP1
UNIPROT:Q8GAK6
UNIPROT:Q01563
UNIPROT:P52067
UNIPROT:P76241
UNIPROT:P00251
UNIPROT:P30855
UNIPROT:P0A2D5
UNIPROT:Q9PFB3
UNIPROT:Q9PFB2
UNIPROT:P54536
UNIPROT:O34671
UNIPROT:P27675
UNIPROT:P30860
UNIPROT:Q45515
UNIPROT:O67610
UNIPROT:Q1JUQ0
UNIPROT:P0A4T4
UNIPROT:Q6TMG4
UNIPROT:P9WIT1
UNIPROT:Q44185
UNIPROT:P0A4T4
UNIPROT:Q6SSJ6
UNIPROT:Q9I6Z0
UNIPROT:Q8U8Z6
UNIPROT:P76214
UNIPROT:O34817
UNIPROT:Q9HU83
UNIPROT:O31197
UNIPROT:P40400
UNIPROT:P40401
UNIPROT:P97027
UNIPROT:Q6RSN5
UNIPROT:P94681
UNIPROT:Q55169
UNIPROT:P63284
UNIPROT:Q9HWG9
UNIPROT:B3EWP1
UNIPROT:P52003
UNIPROT:P30749
UNIPROT:P30748
UNIPROT:P12281
UNIPROT:P44902
UNIPROT:P23669
UNIPROT:P19582
UNIPROT:P0A959
UNIPROT:Q8P9V9
UNIPROT:P9WIA3
UNIPROT:P0ACB0
UNIPROT:P0A7R1
UNIPROT:Q8ZK81
UNIPROT:P67675
UNIPROT:P02358
UNIPROT:P32705
UNIPROT:Q9K0I2
UNIPROT:P0A8A4
UNIPROT:Q02UU0
UNIPROT:P26829
UNIPROT:P22759
UNIPROT:P06971
UNIPROT:Q05202
UNIPROT:P42512
UNIPROT:Q44532
UNIPROT:P0C0F6
UNIPROT:Q9PMQ9
UNIPROT:P81549
UNIPROT:A3D8P6
UNIPROT:O52792
UNIPROT:P0ABU9
UNIPROT:P0ABV2
UNIPROT:P0AAC6
UNIPROT:P55135
UNIPROT:Q9KI19
UNIPROT:P39700
UNIPROT:A5F9C1
UNIPROT:Q9KI21
UNIPROT:P12311
UNIPROT:P14565
UNIPROT:P09130
UNIPROT:P0ADY1
UNIPROT:P0ACF4
UNIPROT:P76251
UNIPROT:Q01398
UNIPROT:P76250
UNIPROT:P0ABF8
UNIPROT:P0A8G0
UNIPROT:Q9HZZ2
UNIPROT:P58000
UNIPROT:A5F120
UNIPROT:P68206
UNIPROT:Q5XW77
UNIPROT:P58329
UNIPROT:P27111
UNIPROT:Q2YKI6
UNIPROT:P37388
UNIPROT:Q70I53
UNIPROT:P9WIY9
UNIPROT:P9WGQ7
UNIPROT:P54422
UNIPROT:P45766
UNIPROT:O34671
UNIPROT:P45768
UNIPROT:P27675
UNIPROT:B3EWP1
UNIPROT:Q9I5W0
UNIPROT:P00579
UNIPROT:P0AFI5
UNIPROT:Q9KKU0
UNIPROT:P46336
UNIPROT:P76630
UNIPROT:Q9KWL3
UNIPROT:P22636
UNIPROT:P22635
UNIPROT:A0A0K8P8E7
UNIPROT:O87170
UNIPROT:Q9AQI0
UNIPROT:P00959
UNIPROT:P0AFR2
UNIPROT:P35154
UNIPROT:Q8ZMX9
UNIPROT:P46828
UNIPROT:P44542
UNIPROT:P44543
UNIPROT:Q6ND47
UNIPROT:Q4KBD3
UNIPROT:Q9L9I7
UNIPROT:Q5L2C2
UNIPROT:O32583
UNIPROT:Q5SKG7
UNIPROT:P30137
UNIPROT:P11458
UNIPROT:P30012
UNIPROT:P9WG67
UNIPROT:Q8YSQ6
UNIPROT:P64191
UNIPROT:P0AG86
UNIPROT:P0AC62
UNIPROT:P0AG27
UNIPROT:Q3JWH7
UNIPROT:Q7A5M9
UNIPROT:P12282
UNIPROT:Q1QWN5
UNIPROT:P0AED5
UNIPROT:Q99027
UNIPROT:P0AFH8
UNIPROT:P9WL95
UNIPROT:P0AFH8
UNIPROT:P9WGF7
UNIPROT:Q8XCU6
UNIPROT:P75990
UNIPROT:Q8Y5X1
UNIPROT:P0AA84
UNIPROT:P0AFC0
UNIPROT:Q51422
UNIPROT:P94392
UNIPROT:P0A6A0
UNIPROT:P0A887
UNIPROT:P35077
UNIPROT:P9WIT1
UNIPROT:P0AG80
UNIPROT:P10905
UNIPROT:P94530
UNIPROT:Q8ZLF4
UNIPROT:P10908
UNIPROT:P0A996
UNIPROT:Q9AGG3
UNIPROT:Q58PK7
UNIPROT:P26391
UNIPROT:P37778
UNIPROT:P26393
UNIPROT:Q9HU21
UNIPROT:Q9HXH0
UNIPROT:Q1QWN6
UNIPROT:Q05852
UNIPROT:D3RPC0
UNIPROT:P32664
UNIPROT:P29848
UNIPROT:Q7A3D6
UNIPROT:O52787
UNIPROT:P76387
UNIPROT:P77377
UNIPROT:P71065
UNIPROT:A0QQZ8
UNIPROT:O54068
UNIPROT:P71241
UNIPROT:P46915
UNIPROT:P72780
UNIPROT:Q66DP5
UNIPROT:B3EWP1
UNIPROT:P06621
UNIPROT:P22731
UNIPROT:P45073
UNIPROT:P0AEX7
UNIPROT:P0AAZ4
UNIPROT:Q9I0M4
UNIPROT:Q9I0M3
UNIPROT:P55222
UNIPROT:P43788
UNIPROT:P0A7M2
UNIPROT:P0A7N9
UNIPROT:Q5SII2
UNIPROT:P08179
UNIPROT:P29217
UNIPROT:P95280
UNIPROT:P45867
UNIPROT:P37957
UNIPROT:P0AG40
UNIPROT:P00956
UNIPROT:P00804
UNIPROT:P28304
UNIPROT:Q9KQH7
UNIPROT:P66948
UNIPROT:P37093
UNIPROT:P0C0S1
UNIPROT:P39408
UNIPROT:P0C0V0
UNIPROT:P9WLX5
UNIPROT:P9WFJ7
UNIPROT:P75957
UNIPROT:P75958
UNIPROT:P21893
UNIPROT:A9AKH1
UNIPROT:Q7A3D6
UNIPROT:Q2T083
UNIPROT:P30847
UNIPROT:P69229
UNIPROT:P76641
UNIPROT:Q2YKI6
UNIPROT:Q9I6Y4
UNIPROT:Q2YKI6
UNIPROT:P04983
UNIPROT:P55700
UNIPROT:Q0QLF4
UNIPROT:Q46799
UNIPROT:P95276
UNIPROT:P63486
UNIPROT:P52679
UNIPROT:P74917
UNIPROT:B3EWP1
UNIPROT:P56653
UNIPROT:P16528
UNIPROT:Q44472
UNIPROT:Q07162
UNIPROT:Q986N2
UNIPROT:Q9I6Z0
UNIPROT:Q4VYA5
UNIPROT:P0ACM2
UNIPROT:B5ZA76
UNIPROT:P37113
UNIPROT:P23908
UNIPROT:O84147
UNIPROT:P0A5N7
UNIPROT:P45947
UNIPROT:P16528
UNIPROT:P37676
UNIPROT:Q5SIL5
UNIPROT:P0DOV6
UNIPROT:P44992
UNIPROT:B3EWP1
UNIPROT:P22731
UNIPROT:P0A9V1
UNIPROT:P0AEX7
UNIPROT:P0AD96
UNIPROT:A9X6P9
UNIPROT:P45858
UNIPROT:Q6F4M7
UNIPROT:P27111
UNIPROT:P39829
UNIPROT:P77526
UNIPROT:J7QLC0
UNIPROT:P77379
UNIPROT:P76250
UNIPROT:P0AEP7
UNIPROT:P30147
UNIPROT:P77161
UNIPROT:P44658
UNIPROT:Q47538
UNIPROT:A9WGD1
UNIPROT:O34598
UNIPROT:Q09063
UNIPROT:P22731
UNIPROT:P45073
UNIPROT:P0AEX7
UNIPROT:P27017
UNIPROT:Q03736
UNIPROT:A0A0K8P8E7
UNIPROT:O31672
UNIPROT:Q74Y93
UNIPROT:P9WNP3
UNIPROT:Q72H90
UNIPROT:P0A754
UNIPROT:P03819
UNIPROT:P0AEC6
UNIPROT:Q9I4D4
UNIPROT:Q43974
UNIPROT:Q93T12
UNIPROT:P0A102
UNIPROT:Q01103
UNIPROT:Q52154
UNIPROT:Q52369
UNIPROT:P96579
UNIPROT:P0ADE4
UNIPROT:D2TN57
UNIPROT:P26647
UNIPROT:Q59712
UNIPROT:P56585
UNIPROT:Q5SK65
UNIPROT:P0A8I1
UNIPROT:Q8EBZ9
UNIPROT:Q55081
UNIPROT:P07363
UNIPROT:G3XD24
UNIPROT:P0AFJ5
UNIPROT:P37478
UNIPROT:P42453
UNIPROT:P76422
UNIPROT:P48247
UNIPROT:P15639
UNIPROT:P0A6R3
UNIPROT:P9WNS7
UNIPROT:P0ADR0
UNIPROT:P0ABU2
UNIPROT:P0AAP1
UNIPROT:P75863
UNIPROT:P25535
UNIPROT:Q7VU58
UNIPROT:Q9L9C1
UNIPROT:P76084
UNIPROT:P77467
UNIPROT:P9WNN9
UNIPROT:B3EWP1
UNIPROT:P0AEI1
UNIPROT:P52101
UNIPROT:P0AFU4
UNIPROT:P33136
UNIPROT:P62517
UNIPROT:P0AGD7
UNIPROT:P64432
UNIPROT:Q06067
UNIPROT:P14375
UNIPROT:P13016
UNIPROT:P00452
UNIPROT:O84835
UNIPROT:Q9Z8F9
UNIPROT:P21693
UNIPROT:P44760
UNIPROT:Q180E3
UNIPROT:P77526
UNIPROT:P76250
UNIPROT:D0VWY5
UNIPROT:P0AE12
UNIPROT:P0A8K1
UNIPROT:Q8GB17
UNIPROT:O34777
UNIPROT:P80242
UNIPROT:P0C0K3
UNIPROT:O06724
UNIPROT:B3EWP1
UNIPROT:B3EWP1
UNIPROT:A0A0A1H8I4
UNIPROT:P0A6Z3
UNIPROT:P40719
UNIPROT:P52076
UNIPROT:P26949
UNIPROT:P32684
UNIPROT:Q5SIL0
UNIPROT:O53521
UNIPROT:P14788
UNIPROT:P0AEX7
UNIPROT:P22731
UNIPROT:Q72K16
UNIPROT:B3EWP1
UNIPROT:B3EWP1
UNIPROT:O69687
UNIPROT:Q8GQN9
UNIPROT:P45780
UNIPROT:Q9KKU0
UNIPROT:B8H164
UNIPROT:P42385
UNIPROT:P64592
UNIPROT:Q9KUY4
UNIPROT:K0BE10
UNIPROT:P69874
UNIPROT:P27370
UNIPROT:P43951
UNIPROT:O68282
UNIPROT:Q88P32
UNIPROT:P24137
UNIPROT:P0AAG0
UNIPROT:P0AEG1
UNIPROT:P23847
UNIPROT:P45493
UNIPROT:P75798
UNIPROT:P24305
UNIPROT:Q9I5R6
UNIPROT:P0ADX1
UNIPROT:P04968
UNIPROT:P36561
UNIPROT:D3DFP8
UNIPROT:Q00514
UNIPROT:P41443
UNIPROT:Q00516
UNIPROT:P35818
UNIPROT:Q9I5N9
UNIPROT:P9WQ37
UNIPROT:Q55168
UNIPROT:P53608
UNIPROT:Q8GBW6
UNIPROT:P16421
UNIPROT:P76114
UNIPROT:P43951
UNIPROT:P96065
UNIPROT:P69874
UNIPROT:P37967
UNIPROT:A9X6P9
UNIPROT:P52046
UNIPROT:P52046
UNIPROT:B3EWP1
UNIPROT:Q59I44
UNIPROT:P30347
UNIPROT:P09980
UNIPROT:Q9K0U7
UNIPROT:O34726
UNIPROT:P37454
UNIPROT:P08870
UNIPROT:Q9HVT7
UNIPROT:Q9HVT8
UNIPROT:Q9HVT9
UNIPROT:P0A9X4
UNIPROT:P16926
UNIPROT:P0AD65
UNIPROT:A8AVK0
UNIPROT:P27111
UNIPROT:A8IKD2
UNIPROT:A1TFU9
UNIPROT:P9WQM1
UNIPROT:A9WGD1
UNIPROT:P0AF20
UNIPROT:P09099
UNIPROT:P77376
UNIPROT:A0QYB5
UNIPROT:P0AGI1
UNIPROT:P04983
UNIPROT:P45687
UNIPROT:P0ACI3
UNIPROT:P0A1B7
UNIPROT:Q2STE9
UNIPROT:P0ABA6
UNIPROT:P12985
UNIPROT:P0ABA4
UNIPROT:P0ABA0
UNIPROT:P68699
UNIPROT:P0AB98
UNIPROT:P9WKP3
UNIPROT:P99090
UNIPROT:P0A8J4
UNIPROT:P60720
UNIPROT:P60716
UNIPROT:P43951
UNIPROT:P23533
UNIPROT:O06976
UNIPROT:P77726
UNIPROT:A9X6P9
UNIPROT:C0LA90
UNIPROT:Q81SZ9
UNIPROT:P00138
UNIPROT:P0AAM1
UNIPROT:P73728
UNIPROT:P0AG63
UNIPROT:Q9HWE3
UNIPROT:P0ADY7
UNIPROT:P0A7V3
UNIPROT:Q7DDT5
UNIPROT:P0A7U3
UNIPROT:P60422
UNIPROT:Q9K1I6
UNIPROT:P60723
UNIPROT:P60438
UNIPROT:P0A7R5
UNIPROT:P74250
UNIPROT:P0ACA1
UNIPROT:P0AB89
UNIPROT:P0AA97
UNIPROT:P0AEC0
UNIPROT:P66948
UNIPROT:P0A738
UNIPROT:A0R043
UNIPROT:P26401
UNIPROT:Q8KZ94
UNIPROT:P9WMY1
UNIPROT:P57039
UNIPROT:P37093
UNIPROT:P40137
UNIPROT:A0QYG2
UNIPROT:O34779
UNIPROT:O34507
UNIPROT:P0AEC0
UNIPROT:P80865
UNIPROT:P80886
UNIPROT:Q8ZKL8
UNIPROT:Q8P9X1
UNIPROT:Q9F672
UNIPROT:P0CL17
UNIPROT:Q8X524
UNIPROT:P0A9D2
UNIPROT:P06709
UNIPROT:P14294
UNIPROT:P44602
UNIPROT:P44602
UNIPROT:Q5SKC5
UNIPROT:P0A944
UNIPROT:P76256
UNIPROT:P24228
UNIPROT:Q89G85
UNIPROT:P39153
UNIPROT:Q7A695
UNIPROT:P9WHM1
UNIPROT:P0A4L2
UNIPROT:A0R4I0
UNIPROT:Q56815
UNIPROT:P21599
UNIPROT:P43726
UNIPROT:P23882
UNIPROT:A0A0H3KB98
UNIPROT:Q8DPI7
UNIPROT:E9RGS4
UNIPROT:E9RGS3
UNIPROT:P0ADZ7
UNIPROT:P0ACJ0
UNIPROT:P0A935
UNIPROT:P10958
UNIPROT:P10955
UNIPROT:P0A9F6
UNIPROT:P9WHM3
UNIPROT:Q9I700
UNIPROT:P76268
UNIPROT:O06724
UNIPROT:Q44492
UNIPROT:P31677
UNIPROT:A0R0W9
UNIPROT:P31678
UNIPROT:P37902
UNIPROT:P9WNN7
UNIPROT:P9WKS7
UNIPROT:P0ACM2
UNIPROT:P75897
UNIPROT:P04983
UNIPROT:P75897
UNIPROT:P59701
UNIPROT:Q2YKI6
UNIPROT:B3EWP1
UNIPROT:Q936X3
UNIPROT:Q47538
UNIPROT:P40401
UNIPROT:Q936X2
UNIPROT:B3EWP1
UNIPROT:P39376
UNIPROT:B3EWP1
UNIPROT:P76038
UNIPROT:P9WGJ3
UNIPROT:P94428
UNIPROT:O33341
UNIPROT:Q59477
UNIPROT:P76112
UNIPROT:Q936T0
UNIPROT:Q9I434
UNIPROT:P31135
UNIPROT:P14788
UNIPROT:P43951
UNIPROT:Q8GAL4
UNIPROT:Q51782
UNIPROT:Q2BN77
UNIPROT:Q8GAI3
UNIPROT:Q9KNX6
UNIPROT:Q2YKI6
UNIPROT:P0ACM2
UNIPROT:P9WI85
UNIPROT:B7UTB3
UNIPROT:P37973
UNIPROT:O31101
UNIPROT:P30870
UNIPROT:P0DP66
UNIPROT:Q5SLX2
UNIPROT:P9WQ35
UNIPROT:P0A9B2
UNIPROT:P27302
UNIPROT:Q88QT2
UNIPROT:P15034
UNIPROT:P57039
UNIPROT:P76558
UNIPROT:P00650
UNIPROT:Q63X76
UNIPROT:P0ABZ8
UNIPROT:Q9K187
UNIPROT:Q88QT3
UNIPROT:P0AGL7
UNIPROT:A6X7E7
UNIPROT:A0QNZ7
UNIPROT:P0AEC6
UNIPROT:O31533
UNIPROT:Q0KBD1
UNIPROT:Q0KBC9
UNIPROT:A0A0H3KP73
UNIPROT:Q0KBD2
UNIPROT:Q0KBC7
UNIPROT:O07007
UNIPROT:P17551
UNIPROT:P17552
UNIPROT:Q7UG04
UNIPROT:B7X2B6
UNIPROT:A0A0H3AJF5
UNIPROT:P9WGR7
UNIPROT:P67653
UNIPROT:Q9HVW0
UNIPROT:P03819
UNIPROT:P03004
UNIPROT:Q9I7C4
UNIPROT:P0A2I3
UNIPROT:P9WNY7
UNIPROT:P07639
UNIPROT:P0A6D7
UNIPROT:Q70M91
UNIPROT:O05131
UNIPROT:B4E5Z6
UNIPROT:Q9KVL7
UNIPROT:P76343
UNIPROT:P76342
UNIPROT:P72978
UNIPROT:P73912
UNIPROT:P00106
UNIPROT:Q2SU58
UNIPROT:I6YGS0
UNIPROT:P0ACV0
UNIPROT:P0ACV0
UNIPROT:Q63YH5
UNIPROT:P16918
UNIPROT:Q9I1X7
UNIPROT:D3DFP8
UNIPROT:Q9X024
UNIPROT:P76250
UNIPROT:P9WK87
UNIPROT:Q8FWV8
UNIPROT:Q8G2M6
UNIPROT:Q51487
UNIPROT:Q5FA19
UNIPROT:P96064
UNIPROT:P72827
UNIPROT:P0A9H3
UNIPROT:Q8L3K8
UNIPROT:P24305
UNIPROT:P17169
UNIPROT:P0ACJ0
UNIPROT:B0VPT6
UNIPROT:P04190
UNIPROT:P46859
UNIPROT:A0R4Z6
UNIPROT:P0A9F6
UNIPROT:P44905
UNIPROT:P0AGC3
UNIPROT:Q55026
UNIPROT:P77544
UNIPROT:Q04796
UNIPROT:P06961
UNIPROT:P44541
UNIPROT:P37769
UNIPROT:Q8ZP88
UNIPROT:D3DFG8
UNIPROT:O82872
UNIPROT:P0A812
UNIPROT:P0A809
UNIPROT:P0A9K3
UNIPROT:P71335
UNIPROT:P44814
UNIPROT:P69380
UNIPROT:Q8KIY1
UNIPROT:P10958
UNIPROT:P41256
UNIPROT:P0AFS9
UNIPROT:Q9I5Q5
UNIPROT:P45344
UNIPROT:P0A7X3
UNIPROT:P0AA10
UNIPROT:P37634
UNIPROT:P76077
UNIPROT:P76078
UNIPROT:P76079
UNIPROT:P76080
UNIPROT:P76081
UNIPROT:A0A0H2ZFV1
UNIPROT:P0AEZ1
UNIPROT:P9WJ62
UNIPROT:Q3JY79
UNIPROT:L0TDE1
UNIPROT:Q83DC7
UNIPROT:B3EWP1
UNIPROT:O07007
UNIPROT:B3EWP1
UNIPROT:Q9I6Z0
UNIPROT:O34873
UNIPROT:A9X6P9
UNIPROT:Q88FY5
UNIPROT:O07007
UNIPROT:P37387
UNIPROT:P0AGI4
UNIPROT:Q9F9B0
UNIPROT:P0AF20
UNIPROT:Q6HMS8
UNIPROT:B3EWP1
UNIPROT:P54989
UNIPROT:Q06750
UNIPROT:P46841
UNIPROT:Q9EXP2
UNIPROT:P45857
UNIPROT:P97027
UNIPROT:Q55462
UNIPROT:Q55106
UNIPROT:P96608
UNIPROT:P22731
UNIPROT:P45073
UNIPROT:P0AEX7
UNIPROT:Q8YEE8
UNIPROT:Q9EUT9
UNIPROT:P0AEM9
UNIPROT:P0AEM9
UNIPROT:P27675
UNIPROT:P45493
UNIPROT:O34974
UNIPROT:Q9EUT9
UNIPROT:Q47083
UNIPROT:P31547
UNIPROT:Q87RS1
UNIPROT:P28635
UNIPROT:P28635
UNIPROT:E9JFX9
UNIPROT:P54998
UNIPROT:P54998
UNIPROT:Q9I6H0
UNIPROT:P0A944
UNIPROT:P9WQK7
UNIPROT:Q8YEE8
UNIPROT:P0C0L7
UNIPROT:Q9X9Q7
UNIPROT:P45202
UNIPROT:P0AG34
UNIPROT:Q9I2A0
UNIPROT:Q8ZQ30
UNIPROT:P9WPQ3
UNIPROT:Q7A6A9
UNIPROT:P45857
UNIPROT:Q8GBW6
UNIPROT:Q02458
UNIPROT:P74268
UNIPROT:P76250
UNIPROT:P9WQI5
UNIPROT:Q7BS32
UNIPROT:P31467
UNIPROT:P50733
UNIPROT:P37052
UNIPROT:P45857
UNIPROT:P06621
UNIPROT:P21158
UNIPROT:P45359
UNIPROT:Q8X607
UNIPROT:P61517
UNIPROT:P45857
UNIPROT:P22186
UNIPROT:P0ABS1
UNIPROT:P24203
UNIPROT:P75876
UNIPROT:Q9I310
UNIPROT:O31207
UNIPROT:P0A6K1
UNIPROT:P35159
UNIPROT:P39338
UNIPROT:P25894
UNIPROT:P45577
UNIPROT:Q9PA93
UNIPROT:Q9KL24
UNIPROT:O34677
UNIPROT:P0AER5
UNIPROT:P54953
UNIPROT:P37902
UNIPROT:P0AC38
UNIPROT:P39376
UNIPROT:P20668
UNIPROT:Q7BS32
UNIPROT:C0LA90
UNIPROT:A9WGD2
UNIPROT:Q55463
UNIPROT:A9WGD1
UNIPROT:O32145
UNIPROT:O32144
UNIPROT:O32143
UNIPROT:Q2G506
UNIPROT:P39062
UNIPROT:A9X6P9
UNIPROT:B3EWP1
UNIPROT:P0ACN4
UNIPROT:P44886
UNIPROT:G4V4T7
UNIPROT:P0AEX7
UNIPROT:P45073
UNIPROT:P22731
UNIPROT:O53521
UNIPROT:P21175
UNIPROT:P21175
UNIPROT:P77379
UNIPROT:P0ADL1
UNIPROT:P76113
UNIPROT:Q9KGL3
UNIPROT:P37486
UNIPROT:Q54818
UNIPROT:Q5ZRP2
UNIPROT:P0AA37
UNIPROT:O69052
UNIPROT:P0AFS7
UNIPROT:Q5KY94
UNIPROT:P77333
UNIPROT:P0A0H9
UNIPROT:P9WPQ3
UNIPROT:P45867
UNIPROT:Q8GBW6
UNIPROT:O34717
UNIPROT:Q0S7V2
UNIPROT:P9WGH3
UNIPROT:P0AGM2
UNIPROT:Q73SC8
UNIPROT:P77467
UNIPROT:O32177
UNIPROT:O32178
UNIPROT:Q5LLW7
UNIPROT:Q7A3D6
UNIPROT:Q39BA7
UNIPROT:Q1NEI6
UNIPROT:Q8P3K4
UNIPROT:P0ACL7
UNIPROT:P44542
UNIPROT:Q9KR65
UNIPROT:Q9KR66
UNIPROT:P39829
UNIPROT:P30178
UNIPROT:Q9KVD2
UNIPROT:P45336
UNIPROT:P52076
UNIPROT:Q9HTN2
UNIPROT:O34753
UNIPROT:P07874
UNIPROT:P42953
UNIPROT:Q7A713
UNIPROT:A0QUV5
UNIPROT:A7TZT2
UNIPROT:Q44493
UNIPROT:O05979
UNIPROT:P35828
UNIPROT:Q7ANN4
UNIPROT:Q7ANN4
UNIPROT:Q7ANN5
UNIPROT:P02930
UNIPROT:P9WMH2
UNIPROT:P40137
UNIPROT:Q9JRN5
UNIPROT:Q9JRN7
UNIPROT:P9WMY9
UNIPROT:A0R043
UNIPROT:Q6F7B8
UNIPROT:Q8YS92
UNIPROT:P9WKC9
UNIPROT:Q1KLK0
UNIPROT:P18775
UNIPROT:P04816
UNIPROT:O68873
UNIPROT:P45493
UNIPROT:B4E9P3
UNIPROT:A0A0N8YGA2
UNIPROT:A7N6S2
UNIPROT:P10408
UNIPROT:P0AFS9
UNIPROT:P45206
UNIPROT:P0ACA3
UNIPROT:Q82W83
UNIPROT:Q02761
UNIPROT:P22731
UNIPROT:P45073
UNIPROT:P0AEX7
UNIPROT:Q8YEE8
UNIPROT:P9WNN9
UNIPROT:Q5P5G5
UNIPROT:Q5P5G4
UNIPROT:O07610
UNIPROT:P9WJW5
UNIPROT:P9WJW3
UNIPROT:Q02E40
UNIPROT:P24173
UNIPROT:Q45374
UNIPROT:Q9K2Y1
UNIPROT:D3RPB9
UNIPROT:P0A7A5
UNIPROT:C4ZSG4
UNIPROT:P13510
UNIPROT:P0ACU2
UNIPROT:P14930
UNIPROT:P0AAF6
UNIPROT:Q9JZN9
UNIPROT:O32142
UNIPROT:Q69HT9
UNIPROT:P22862
UNIPROT:P37508
UNIPROT:P21159
UNIPROT:Q51697
UNIPROT:Q51698
UNIPROT:O32146
UNIPROT:O31851
UNIPROT:P39163
UNIPROT:P21909
UNIPROT:P0A955
UNIPROT:Q68AP4
UNIPROT:P24077
UNIPROT:P21367
UNIPROT:P75919
UNIPROT:O07552
UNIPROT:E1VBK4
UNIPROT:P32113
UNIPROT:D9IA43
UNIPROT:Q5GCA5
UNIPROT:P52636
UNIPROT:Q54794
UNIPROT:P0A9E5
UNIPROT:P77915
UNIPROT:P0AC81
UNIPROT:B3EWP1
UNIPROT:P28820
UNIPROT:P0CI72
UNIPROT:H9N289
UNIPROT:H9N289
UNIPROT:A9WGD2
UNIPROT:Q9KWS2
UNIPROT:P9WIT1
UNIPROT:A9WGD1
UNIPROT:Q55463
UNIPROT:A9WGD2
UNIPROT:P83772
UNIPROT:B9J8S0
UNIPROT:P0ACU2
UNIPROT:P45382
UNIPROT:P44556
UNIPROT:P39315
UNIPROT:P77379
UNIPROT:P37662
UNIPROT:Q2G140
UNIPROT:Q1JUP4
UNIPROT:P39640
UNIPROT:Q6BF16
UNIPROT:Q92RN7
UNIPROT:P45416
UNIPROT:P37552
UNIPROT:P42061
UNIPROT:P75798
UNIPROT:P77463
UNIPROT:P24136
UNIPROT:P24137
UNIPROT:O82872
UNIPROT:Q88P32
UNIPROT:P72349
UNIPROT:Q1JUQ1
UNIPROT:P0ACL7
UNIPROT:P37674
UNIPROT:Q9KR66
UNIPROT:A1WPV4
UNIPROT:Q6BF17
UNIPROT:P0AG38
UNIPROT:P25888
UNIPROT:P0A9W3
UNIPROT:P9WM23
UNIPROT:P32113
UNIPROT:Q9X5X4
UNIPROT:P0ACS5
UNIPROT:Q1LK00
UNIPROT:P83788
UNIPROT:P0C8P4
UNIPROT:P11724
UNIPROT:O66442
UNIPROT:P27273
UNIPROT:P0AEL0
UNIPROT:O87816
UNIPROT:P9WKP3
UNIPROT:P16456
UNIPROT:P33667
UNIPROT:P9WI78
UNIPROT:O31607
UNIPROT:A8AVK0
UNIPROT:O30795
UNIPROT:P76053
UNIPROT:Q02170
UNIPROT:P0AEM0
UNIPROT:P62623
UNIPROT:Q10725
UNIPROT:P0A8L1
UNIPROT:Q9I1W5
UNIPROT:P0DMC6
UNIPROT:P13800
UNIPROT:Q9PIB4
UNIPROT:P0AFR4
UNIPROT:A6WUW2
UNIPROT:Q7NXD4
UNIPROT:P0ACA7
UNIPROT:Q81NT5
UNIPROT:P9WL85
UNIPROT:Q14SY0
UNIPROT:P25888
UNIPROT:P05637
UNIPROT:P10799
UNIPROT:P0AED5
UNIPROT:Q7WZY5
UNIPROT:P71403
UNIPROT:P07363
UNIPROT:P02942
UNIPROT:P0A964
UNIPROT:P02941
UNIPROT:P50466
UNIPROT:P07801
UNIPROT:Q9X005
UNIPROT:P07330
UNIPROT:Q75XW3
UNIPROT:P54700
UNIPROT:P0A1L1
UNIPROT:P26419
UNIPROT:P06974
UNIPROT:P26416
UNIPROT:P0A1K1
UNIPROT:P26465
UNIPROT:P15934
UNIPROT:P0A1J9
UNIPROT:P15928
UNIPROT:P26462
UNIPROT:P26609
UNIPROT:Q9K3C5
UNIPROT:P21184
UNIPROT:P21184
UNIPROT:O34201
UNIPROT:O52222
UNIPROT:P09348
UNIPROT:P0AF06
UNIPROT:P0A2D5
UNIPROT:C6EBU6
UNIPROT:P40727
UNIPROT:P40729
UNIPROT:Q01960
UNIPROT:P0AEM6
UNIPROT:O07084
UNIPROT:P40131
UNIPROT:P16437
UNIPROT:P0A1I7
UNIPROT:P0A1I9
UNIPROT:P0A1J1
UNIPROT:P16323
UNIPROT:P0A1J3
UNIPROT:P15931
UNIPROT:P0A1J5
UNIPROT:P16326
UNIPROT:P0AE45
UNIPROT:P45370
UNIPROT:P0AFS7
UNIPROT:Q9X0L4
UNIPROT:Q9WYB1
UNIPROT:O67505
UNIPROT:P76594
UNIPROT:P0A993
UNIPROT:P04825
UNIPROT:P77475
UNIPROT:P07012
UNIPROT:Q2G2U4
UNIPROT:Q9WYN9
UNIPROT:P31105
UNIPROT:P0C0F6
UNIPROT:P9WNH3
UNIPROT:P39062
UNIPROT:P0A9T4
UNIPROT:P45202
UNIPROT:P60782
UNIPROT:P44096
UNIPROT:Q9HZM7
UNIPROT:Q7VYU0
UNIPROT:A0R610
UNIPROT:A0R607
UNIPROT:P9WFR5
UNIPROT:Q7VTJ9
UNIPROT:Q6FF51
UNIPROT:P24178
UNIPROT:P08192
UNIPROT:P08550
UNIPROT:P0AG16
UNIPROT:P55218
UNIPROT:Q2SX36
UNIPROT:D4ZX35
UNIPROT:P13267
UNIPROT:Q9WYC4
UNIPROT:O86109
UNIPROT:O86109
UNIPROT:P02941
UNIPROT:P77549
UNIPROT:O31675
UNIPROT:P05704
UNIPROT:P58152
UNIPROT:Q7DDS7
UNIPROT:P65359
UNIPROT:P75770
UNIPROT:P0AA84
UNIPROT:P77258
UNIPROT:P0A9D2
UNIPROT:P9WL93
UNIPROT:Q8GHB1
UNIPROT:P9WMH2
UNIPROT:B1Q2A8
UNIPROT:P0AD96
UNIPROT:P0AEX7
UNIPROT:P9WQM1
UNIPROT:P22731
UNIPROT:O05619
UNIPROT:P0AFT8
UNIPROT:Q88FF8
UNIPROT:Q8XAY0
UNIPROT:P0A6U5
UNIPROT:P76249
UNIPROT:P37522
UNIPROT:Q83AH2
UNIPROT:Q7WZY4
UNIPROT:O34757
UNIPROT:P9WIT1
UNIPROT:Q1LJ80
UNIPROT:P45083
UNIPROT:P75892
UNIPROT:Q89FH0
UNIPROT:B3EWP1
UNIPROT:O68014
UNIPROT:P9WQD1
UNIPROT:Q0P9X8
UNIPROT:Q04064
UNIPROT:B3EWP1
UNIPROT:P99149
UNIPROT:B3EWP1
UNIPROT:P94678
UNIPROT:P27111
UNIPROT:B3EWP1
UNIPROT:P74840
UNIPROT:A9X6P9
UNIPROT:P52046
UNIPROT:Q7TXL8
UNIPROT:B3EWP1
UNIPROT:P76250
UNIPROT:O34296
UNIPROT:B3EWP1
UNIPROT:P77489
UNIPROT:P77324
UNIPROT:P77165
UNIPROT:P76250
UNIPROT:P27111
UNIPROT:B3EWP1
UNIPROT:P0A2C9
UNIPROT:Q0QLF5
UNIPROT:Q88FY0
UNIPROT:Q6Q271
UNIPROT:Q88FY1
UNIPROT:P00632
UNIPROT:P96608
UNIPROT:P0ACV6
UNIPROT:P96726
UNIPROT:Q9HTQ0
UNIPROT:P9WMF5
UNIPROT:P04252
UNIPROT:P0AFU4
UNIPROT:E0X9C7
UNIPROT:P55619
UNIPROT:Q9ZFG9
UNIPROT:P21175
UNIPROT:O86436
UNIPROT:P0AF98
UNIPROT:P0ADC6
UNIPROT:O87690
UNIPROT:P45600
UNIPROT:P77806
UNIPROT:Q7SIF3
UNIPROT:Q0PB48
UNIPROT:P0AEC3
UNIPROT:P30863
UNIPROT:P0A6C5
UNIPROT:P9WHK0
UNIPROT:P9WLM0
UNIPROT:P0A6M2
UNIPROT:P14788
UNIPROT:P0AEB0
UNIPROT:P27367
UNIPROT:P0AG78
UNIPROT:Q9L3Q5
UNIPROT:O83046
UNIPROT:P00438
UNIPROT:Q1MW86
UNIPROT:P9WQG3
UNIPROT:Q93AM1
UNIPROT:P19219
UNIPROT:Q3S8Q4
UNIPROT:Q9HV27
UNIPROT:V5YMB3
UNIPROT:P09030
UNIPROT:P9WFN3
UNIPROT:Q05202
UNIPROT:B3EWP1
UNIPROT:O34873
UNIPROT:A9X6P9
UNIPROT:P77743
UNIPROT:Q7BBW0
UNIPROT:P17904
UNIPROT:Q9WVX8
UNIPROT:P9WIY9
UNIPROT:B8GZM2
UNIPROT:P0AEC6
UNIPROT:P0AEC6
UNIPROT:Q7WLU9
UNIPROT:P9WIY7
UNIPROT:Q8GHB1
UNIPROT:P43531
UNIPROT:P37894
UNIPROT:P9WG91
UNIPROT:B3EWP1
UNIPROT:P83310
UNIPROT:P9WQN3
UNIPROT:P0AAQ9
UNIPROT:P0A0J9
UNIPROT:O32242
UNIPROT:P45857
UNIPROT:O53871
UNIPROT:P77399
UNIPROT:P27245
UNIPROT:B0B8F4
UNIPROT:P17551
UNIPROT:P14285
UNIPROT:C4XNQ9
UNIPROT:P0A9T4
UNIPROT:P63177
UNIPROT:P54735
UNIPROT:O34392
UNIPROT:Q9HZY8
UNIPROT:Q07703
UNIPROT:Q2SWD3
UNIPROT:Q4FTX3
UNIPROT:P0ABC3
UNIPROT:P0ABC7
UNIPROT:P25519
UNIPROT:P0A6X3
UNIPROT:Q9XCI8
UNIPROT:Q9HXJ7
UNIPROT:P43989
UNIPROT:Q2SWE3
UNIPROT:Q72H18
UNIPROT:P36979
UNIPROT:Q2SWE7
UNIPROT:P37765
UNIPROT:Q83CP9
UNIPROT:P65836
UNIPROT:Q9K0B1
UNIPROT:P27296
UNIPROT:O34779
UNIPROT:P64612
UNIPROT:P14218
UNIPROT:P20708
UNIPROT:P39695
UNIPROT:P9WL93
UNIPROT:P9WL97
UNIPROT:P9WL93
UNIPROT:P0AFI2
UNIPROT:P0AGC3
UNIPROT:Q9KL03
UNIPROT:P0A2I5
UNIPROT:O34456
UNIPROT:Q7A3L9
UNIPROT:Q9I4W3
UNIPROT:Q9JZR5
UNIPROT:C0SP91
UNIPROT:P27431
UNIPROT:Q7CFU4
UNIPROT:Q2T0H6
UNIPROT:P0ABQ0
UNIPROT:P0A7E5
UNIPROT:B4EDA2
UNIPROT:P37869
UNIPROT:Q9KUJ3
UNIPROT:P0A6Y5
UNIPROT:P0A9W9
UNIPROT:Q9KVK6
UNIPROT:B3EWP1
UNIPROT:O31206
UNIPROT:P30855
UNIPROT:B3EWP1
UNIPROT:P97030
UNIPROT:P44492
UNIPROT:P63883
UNIPROT:O52380
UNIPROT:O52379
UNIPROT:P37552
UNIPROT:P72349
UNIPROT:P24137
UNIPROT:P24136
UNIPROT:P75799
UNIPROT:P75798
UNIPROT:P75797
UNIPROT:Q9HXE3
UNIPROT:P44103
UNIPROT:Q81SZ9
UNIPROT:O31158
UNIPROT:O32197
UNIPROT:O87940
UNIPROT:P0ACJ0
UNIPROT:Q7A2Q1
UNIPROT:O32197
UNIPROT:P75919
UNIPROT:P0ABP6
UNIPROT:Q5F8M6
UNIPROT:P09394
UNIPROT:P9WLC7
UNIPROT:P28246
UNIPROT:P16384
UNIPROT:P9WQK1
UNIPROT:Q8YS92
UNIPROT:P75797
UNIPROT:P75798
UNIPROT:P75799
UNIPROT:P75796
UNIPROT:K9NBS6
UNIPROT:Q9HWT6
UNIPROT:E9JFX9
UNIPROT:P77626
UNIPROT:P0ABN5
UNIPROT:Q9I5N9
UNIPROT:P0AFT5
UNIPROT:P0AD14
UNIPROT:Q9HVM1
UNIPROT:B4E9G3
UNIPROT:P9WFM5
UNIPROT:P69931
UNIPROT:P9WGJ1
UNIPROT:P0ABF1
UNIPROT:P43777
UNIPROT:P9WJX9
UNIPROT:P0A388
UNIPROT:P00106
UNIPROT:P46125
UNIPROT:P14788
UNIPROT:P0AF01
UNIPROT:P37329
UNIPROT:P13036
UNIPROT:O32147
UNIPROT:Q9HV61
UNIPROT:Q0B2F6
UNIPROT:Q9AIU0
UNIPROT:P52477
UNIPROT:P24181
UNIPROT:Q51487
UNIPROT:E1WGN1
UNIPROT:P42409
UNIPROT:P42410
UNIPROT:P42411
UNIPROT:P17906
UNIPROT:P23545
UNIPROT:Q9HU16
UNIPROT:Q21XD7
UNIPROT:Q5P5G4
UNIPROT:Q2SY18
UNIPROT:Q7WGU8
UNIPROT:P0AB60
UNIPROT:P0A6Y1
UNIPROT:Q9JZ44
UNIPROT:Q9KRB0
UNIPROT:Q04983
UNIPROT:P0A9J8
UNIPROT:P37411
UNIPROT:P13794
UNIPROT:P17993
UNIPROT:Q9HZ62
UNIPROT:P0AA43
UNIPROT:O34928
UNIPROT:P9WPE3
UNIPROT:Q46ZL2
UNIPROT:P0AFB5
UNIPROT:P41789
UNIPROT:P09030
UNIPROT:P0ADK4
UNIPROT:P0AFV0
UNIPROT:P77211
UNIPROT:P04174
UNIPROT:P0A886
UNIPROT:P9WIT1
UNIPROT:P0A8K8
UNIPROT:O07610
UNIPROT:P53554
UNIPROT:P27747
UNIPROT:Q8L0X4
UNIPROT:P12008
UNIPROT:Q47142
UNIPROT:P00632
UNIPROT:P39830
UNIPROT:P0A2D5
UNIPROT:P33195
UNIPROT:Q9WY55
UNIPROT:P64225
UNIPROT:Q9HU16
UNIPROT:Q9KR65
UNIPROT:A1WPV4
UNIPROT:Q9HZP5
UNIPROT:P50199
UNIPROT:P39314
UNIPROT:P02941
UNIPROT:P02942
UNIPROT:A0A0A1H8I4
UNIPROT:P27111
UNIPROT:P76250
UNIPROT:P77389
UNIPROT:P30863
UNIPROT:B3EWP1
UNIPROT:P22731
UNIPROT:P45073
UNIPROT:P0AEX7
UNIPROT:Q8GAL4
UNIPROT:P94679
UNIPROT:P33164
UNIPROT:P80877
UNIPROT:P05100
UNIPROT:Q83AY0
UNIPROT:P0AA41
UNIPROT:P00370
UNIPROT:Q7A3Q5
UNIPROT:P45742
UNIPROT:Q9AQI0
UNIPROT:B3EWP1
UNIPROT:P0DOV7
UNIPROT:P0ACP5
UNIPROT:Q6FFQ1
UNIPROT:B3EWP1
UNIPROT:O07007
UNIPROT:Q888H1
UNIPROT:P0DOV6
UNIPROT:Q128M1
UNIPROT:P37674
UNIPROT:O07838
UNIPROT:P44542
UNIPROT:P52697
UNIPROT:B3EWP1
UNIPROT:Q9F466
UNIPROT:Q8VTT5
UNIPROT:P0AFR9
UNIPROT:P0AFK4
UNIPROT:P43951
UNIPROT:P68187
UNIPROT:K9NBS6
UNIPROT:P0ACM2
UNIPROT:Q05115
UNIPROT:P31801
UNIPROT:P76268
UNIPROT:P69874
UNIPROT:P0AFK4
UNIPROT:P0AFR9
UNIPROT:O85226
UNIPROT:O85227
UNIPROT:P40875
UNIPROT:P43951
UNIPROT:Q5LLW6
UNIPROT:P69451
UNIPROT:P9WGQ7
UNIPROT:Q8YEE8
UNIPROT:P0AEX7
UNIPROT:P27675
UNIPROT:P22731
UNIPROT:P40676
UNIPROT:P0A0W2
UNIPROT:P37552
UNIPROT:Q5KWZ6
UNIPROT:Q9K0N4
UNIPROT:P08622
UNIPROT:Q79VI7
UNIPROT:P40138
UNIPROT:P0AEC3
UNIPROT:Q09049
UNIPROT:P26458
UNIPROT:O06179
UNIPROT:Q5LLW7
UNIPROT:P38974
UNIPROT:P38975
UNIPROT:Q8KR68
UNIPROT:Q9HU16
UNIPROT:P39285
UNIPROT:G4V4T7
UNIPROT:Q9HXM1
UNIPROT:P31473
UNIPROT:Q9HZX3
UNIPROT:P41052
UNIPROT:O07427
UNIPROT:P9WLA9
UNIPROT:Q9KLK7
UNIPROT:P54302
UNIPROT:Q9HX91
UNIPROT:Q8DI95
UNIPROT:B3EWP1
UNIPROT:P62723
UNIPROT:Q5LRT0
UNIPROT:P39398
UNIPROT:P28304
UNIPROT:B0VHH0
UNIPROT:P37610
UNIPROT:P9WI65
UNIPROT:P0AA53
UNIPROT:P0A9H3
UNIPROT:Q9K151
UNIPROT:P0AFU0
UNIPROT:P33915
UNIPROT:P33916
UNIPROT:P40711
UNIPROT:P0A8A8
UNIPROT:Q83BS0
UNIPROT:P0A705
UNIPROT:P65967
UNIPROT:P60340
UNIPROT:P32132
UNIPROT:Q0AVM1
UNIPROT:P0A832
UNIPROT:P0AGL5
UNIPROT:Q4QNE7
UNIPROT:P0ADG7
UNIPROT:B4RJH7
UNIPROT:P21341
UNIPROT:P22643
UNIPROT:P76008
UNIPROT:P40138
UNIPROT:P0AFQ7
UNIPROT:P06710
UNIPROT:Q2SWM4
UNIPROT:P28306
UNIPROT:P0ADE8
UNIPROT:Q5HH30
UNIPROT:P9WK87
UNIPROT:Q59477
UNIPROT:P46837
UNIPROT:P9WJW5
UNIPROT:P9WGP9
UNIPROT:P33224
UNIPROT:P76113
UNIPROT:P50199
UNIPROT:Q5LLW7
UNIPROT:O32177
UNIPROT:P0AED7
UNIPROT:P28793
UNIPROT:P22731
UNIPROT:P45073
UNIPROT:P0AEX7
UNIPROT:P21175
UNIPROT:Q5LRT0
UNIPROT:A7N6S2
UNIPROT:Q9KM66
UNIPROT:Q3JRF0
UNIPROT:P0A710
UNIPROT:P0ABE2
UNIPROT:P40415
UNIPROT:Q9K1I8
UNIPROT:P02359
UNIPROT:Q6N4T2
UNIPROT:P08506
UNIPROT:P07324
UNIPROT:Q59643
UNIPROT:Q9WZ31
UNIPROT:P45875
UNIPROT:P0A115
UNIPROT:Q51948
UNIPROT:P52037
UNIPROT:P0AEY1
UNIPROT:P9WMH2
UNIPROT:P45031
UNIPROT:P64606
UNIPROT:P76082
UNIPROT:P23694
UNIPROT:Q9I702
UNIPROT:Q00593
UNIPROT:P69680
UNIPROT:P0AD30
UNIPROT:Q8KG93
UNIPROT:P0AG44
UNIPROT:P0A0Y1
UNIPROT:P0A7V8
UNIPROT:P0A7R9
UNIPROT:P0A7S9
UNIPROT:P0A7Q6
UNIPROT:Q2SU48
UNIPROT:P0AGA2
UNIPROT:P02413
UNIPROT:Q9HWF3
UNIPROT:P0A7W1
UNIPROT:P46899
UNIPROT:Q9K1I3
UNIPROT:P0A7W7
UNIPROT:P0AG59
UNIPROT:P62399
UNIPROT:P60624
UNIPROT:Q7DDT2
UNIPROT:P9WN79
UNIPROT:P22008
UNIPROT:P0AGK1
UNIPROT:P76536
UNIPROT:P0ACQ4
UNIPROT:P24230
UNIPROT:P0A7F9
UNIPROT:P0A847
UNIPROT:O07584
UNIPROT:P52600
UNIPROT:Q47147
UNIPROT:P04425
UNIPROT:P76103
UNIPROT:P22731
UNIPROT:P0AEX7
UNIPROT:Q8YEE8
UNIPROT:B5ZA76
UNIPROT:P0ACM2
UNIPROT:Q2SZ88
UNIPROT:P28630
UNIPROT:Q9JXT2
UNIPROT:P0ABV2
UNIPROT:P0ABU9
UNIPROT:Q2SZ94
UNIPROT:P0A937
UNIPROT:Q03456
UNIPROT:Q9RE09
UNIPROT:P69829
UNIPROT:Q9RVE7
UNIPROT:P0ABI4
UNIPROT:A6T4W3
UNIPROT:P23847
UNIPROT:P77622
UNIPROT:P0AAG0
UNIPROT:P0AEG1
UNIPROT:P0AEF8
UNIPROT:P69451
UNIPROT:P0AB67
UNIPROT:P07001
UNIPROT:O84467
UNIPROT:P0ABV2
UNIPROT:P0ABV2
UNIPROT:P0ABU9
UNIPROT:A1AA28
UNIPROT:B3EWP1
UNIPROT:B3EWP1
UNIPROT:O67610
UNIPROT:Q1QWN6
UNIPROT:B3EWP1
UNIPROT:Q8GHB1
UNIPROT:Q9Z4P0
UNIPROT:P07774
UNIPROT:P25745
UNIPROT:P37902
UNIPROT:P39270
UNIPROT:Q83EP4
UNIPROT:P77726
UNIPROT:P0A698
UNIPROT:P9WIU3
UNIPROT:P0ABT8
UNIPROT:P07268
UNIPROT:P25894
UNIPROT:P0AE16
UNIPROT:P23971
UNIPROT:P0AA16
UNIPROT:P18392
UNIPROT:Q9X5X4
UNIPROT:P37692
UNIPROT:O86428
UNIPROT:P24554
UNIPROT:O34777
UNIPROT:P37352
UNIPROT:P37352
UNIPROT:Q9HTJ1
UNIPROT:Q05353
UNIPROT:Q05354
UNIPROT:P42270
UNIPROT:O05151
UNIPROT:B3EWP1
UNIPROT:Q9L9F0
UNIPROT:P23895
UNIPROT:P29012
UNIPROT:P39196
UNIPROT:P74250
UNIPROT:P0AE39
UNIPROT:O07020
UNIPROT:P9WIT1
UNIPROT:P0DOV6
UNIPROT:P76584
UNIPROT:P69680
UNIPROT:P0AC55
UNIPROT:P45049
UNIPROT:P31833
UNIPROT:Q92RG6
UNIPROT:P06030
UNIPROT:P12946
UNIPROT:Q3J5F9
UNIPROT:P54178
UNIPROT:B3EWP1
UNIPROT:P0AAP1
UNIPROT:P0AGB3
UNIPROT:Q7SIA8
UNIPROT:P14929
UNIPROT:P9WHT5
UNIPROT:P9WHT5
UNIPROT:P0ADX9
UNIPROT:Q3JW91
UNIPROT:P00632
UNIPROT:D3DFP8
UNIPROT:K9NBS6
UNIPROT:P23847
UNIPROT:P9WQJ5
UNIPROT:P76027
UNIPROT:P75799
UNIPROT:P75798
UNIPROT:P00208
UNIPROT:Q2T1B9
UNIPROT:Q02G03
UNIPROT:Q63XL8
UNIPROT:Q8FI04
UNIPROT:P05523
UNIPROT:P17802
UNIPROT:P0A894
UNIPROT:P05824
UNIPROT:P65774
UNIPROT:P30727
UNIPROT:P30128
UNIPROT:P0AG24
UNIPROT:Q02E25
UNIPROT:P60546
UNIPROT:P44726
UNIPROT:A6QGC0
UNIPROT:Q8Y678
UNIPROT:P50597
UNIPROT:P52061
UNIPROT:P54304
UNIPROT:J7QLC0
UNIPROT:Q03287
UNIPROT:P17091
UNIPROT:P18317
UNIPROT:P17086
UNIPROT:Q2YPD6
UNIPROT:Q03282
UNIPROT:P54302
UNIPROT:Q9HV27
UNIPROT:Q9JXZ2
UNIPROT:Q5ZRP2
UNIPROT:Q5SK28
UNIPROT:P9WI85
UNIPROT:A0A0H2ZDT7
UNIPROT:P42619
UNIPROT:P46852
UNIPROT:P67661
UNIPROT:Q888A4
UNIPROT:P27305
UNIPROT:P44648
UNIPROT:P00957
UNIPROT:Q44492
UNIPROT:Q04441
UNIPROT:Q04441
UNIPROT:Q04441
UNIPROT:Q04440
UNIPROT:O84741
UNIPROT:B8H427
UNIPROT:B8H428
UNIPROT:P64578
UNIPROT:Q7A4T3
UNIPROT:P9WJP7
UNIPROT:P77202
UNIPROT:P9WPW3
UNIPROT:P42315
UNIPROT:O87940
UNIPROT:P0A964
UNIPROT:P02941
UNIPROT:P02942
UNIPROT:P02942
UNIPROT:O68146
UNIPROT:P0A2R4
UNIPROT:P20958
UNIPROT:Q06530
UNIPROT:P36929
UNIPROT:O34206
UNIPROT:P06628
UNIPROT:P02975
UNIPROT:P45759
UNIPROT:Q9HTQ0
UNIPROT:Q5P5G5
UNIPROT:B3EWP1
UNIPROT:Q51645
UNIPROT:P37552
UNIPROT:P09546
UNIPROT:P0ACJ0
UNIPROT:P0A6F3
UNIPROT:P44506
UNIPROT:P24559
UNIPROT:P9WMH2
UNIPROT:P24559
UNIPROT:P77161
UNIPROT:Q5F5E3
UNIPROT:P67087
UNIPROT:P17888
UNIPROT:Q2STF3
UNIPROT:Q93CH6
UNIPROT:P75849
UNIPROT:Q55168
UNIPROT:P26647
UNIPROT:Q7WG29
UNIPROT:P00961
UNIPROT:P00960
UNIPROT:Q8ZKL0
UNIPROT:P9WJH9
UNIPROT:P23930
UNIPROT:P0AE78
UNIPROT:P0ACJ5
UNIPROT:P80064
UNIPROT:P30967
UNIPROT:P40138
UNIPROT:A9YWT8
UNIPROT:P0ACJ8
UNIPROT:P0C0F6
UNIPROT:P40138
UNIPROT:P37957
UNIPROT:P9WMV7
UNIPROT:P69451
UNIPROT:Q51698
UNIPROT:Q51697
UNIPROT:P43672
UNIPROT:P75919
UNIPROT:Q72LL6
UNIPROT:P38101
UNIPROT:Q72LL6
UNIPROT:P0ACA7
UNIPROT:Q51792
UNIPROT:P0AG34
UNIPROT:A4F2N8
UNIPROT:A6QGB8
UNIPROT:Q55367
UNIPROT:Q55108
UNIPROT:Q55461
UNIPROT:P38043
UNIPROT:P0AED5
UNIPROT:O32198
UNIPROT:P83772
UNIPROT:O67323
UNIPROT:P36655
UNIPROT:P35160
UNIPROT:P0A8F8
UNIPROT:P0AGK8
UNIPROT:P0A6B9
UNIPROT:P0ACD6
UNIPROT:P0AAC8
UNIPROT:Q9KTX9
UNIPROT:P0A6Z1
UNIPROT:P0A9R4
UNIPROT:Q0QLE4
UNIPROT:Q9Z4P0
UNIPROT:P77671
UNIPROT:Q0QLE1
UNIPROT:Q726X4
UNIPROT:B3EWP1
UNIPROT:P22731
UNIPROT:P45073
UNIPROT:P0AEX7
UNIPROT:P21175
UNIPROT:Q9K492
UNIPROT:P50199
UNIPROT:P03007
UNIPROT:P37339
UNIPROT:P0A7E1
UNIPROT:P0A7Z0
UNIPROT:P44758
UNIPROT:P27237
UNIPROT:Q0PA35
UNIPROT:Q8KR08
UNIPROT:P23222
UNIPROT:Q9ZHC7
UNIPROT:P12374
UNIPROT:P77214
UNIPROT:P77239
UNIPROT:P38054
UNIPROT:P0ACS5
UNIPROT:P20021
UNIPROT:P39595
UNIPROT:P13510
UNIPROT:P13511
UNIPROT:P21179
UNIPROT:P0AG07
UNIPROT:Q7VU61
UNIPROT:Q8ZR41
UNIPROT:P08692
UNIPROT:P55881
UNIPROT:P76250
UNIPROT:P07018
UNIPROT:P32662
UNIPROT:P20580
UNIPROT:P00901
UNIPROT:P75823
UNIPROT:P0AG34
UNIPROT:Q8PD71
UNIPROT:Q2YRR4
UNIPROT:Q83CW4
UNIPROT:P21179
UNIPROT:P76250
UNIPROT:P18200
UNIPROT:P0A6G3
UNIPROT:P9WL06
UNIPROT:P77333
UNIPROT:P39812
UNIPROT:P73574
UNIPROT:Q9K0V4
UNIPROT:P0A171
UNIPROT:P45073
UNIPROT:P45074
UNIPROT:Q7W2Q0
UNIPROT:P29131
UNIPROT:Q99W05
UNIPROT:P76250
UNIPROT:A9X6P9
UNIPROT:Q06903
UNIPROT:Q8GP20
UNIPROT:Q8GP19
UNIPROT:Q1MW86
UNIPROT:P0AFG9
UNIPROT:P06959
UNIPROT:P0A9P0
UNIPROT:P05431
UNIPROT:Q9K4M4
UNIPROT:Q51698
UNIPROT:Q51697
UNIPROT:P76190
UNIPROT:P75804
UNIPROT:P32967
UNIPROT:Q7WZY5
UNIPROT:P21367
UNIPROT:Q89WV5
UNIPROT:P0A0P0
UNIPROT:P45049
UNIPROT:P9WL95
UNIPROT:P37642
UNIPROT:P13009
UNIPROT:P9WMH2
UNIPROT:B3EWP1
UNIPROT:P67661
UNIPROT:P0ACJ0
UNIPROT:P13009
UNIPROT:P15043
UNIPROT:Q46WL3
UNIPROT:Q606Q2
UNIPROT:Q4FQF7
UNIPROT:P33038
UNIPROT:P0A9W6
UNIPROT:P0AFN6
UNIPROT:P36879
UNIPROT:P0ADV7
UNIPROT:P43262
UNIPROT:P64604
UNIPROT:P64606
UNIPROT:P45031
UNIPROT:P9WN19
UNIPROT:P55037
UNIPROT:P60390
UNIPROT:P0AEN4
UNIPROT:P08149
UNIPROT:P22188
UNIPROT:P11880
UNIPROT:P0A6W3
UNIPROT:Q9HVZ9
UNIPROT:P0ABG4
UNIPROT:Q9HW01
UNIPROT:Q9HW02
UNIPROT:P07862
UNIPROT:P06136
UNIPROT:P37330
UNIPROT:P9WIT1
UNIPROT:Q9RSH2
UNIPROT:P0A8C8
UNIPROT:Q1R4M9
UNIPROT:P25522
UNIPROT:P29281
UNIPROT:L0TCG5
UNIPROT:Q7WF17
UNIPROT:P23136
UNIPROT:P0A742
UNIPROT:Q6D0N8
UNIPROT:Q57354
UNIPROT:P39099
UNIPROT:P69423
UNIPROT:P69428
UNIPROT:P44956
UNIPROT:Q7P0E6
UNIPROT:P9WMM7
UNIPROT:P9WMM3
UNIPROT:Q8ZPV9
UNIPROT:A1R562
UNIPROT:P61779
UNIPROT:P64373
UNIPROT:P07658
UNIPROT:Q54001
UNIPROT:Q8ZNN5
UNIPROT:Q9AGJ6
UNIPROT:Q74FW6

[DO:] Go to the the website linked above, and past in your IDs in the search field. You can now browse through the pathways encoded by those genes.image.png

Visualising in KEGG

Visualising in KEGG ( the Kyoto Encyclopedia of Genes and Genomes) allows us to zoom in further into the metabolism. The best way to approach this, is to take all protein sequences produced by prokka (the .faa file) and do a protein blast to the KEGG database with blastKOala. This takes quite some time and requires an account, so I did this for you already. The files are available in the data/blastKOala directory. Note that the bin numbers may not match up between my example and your own run, this is just for illustration of what's possible with the method.

[DO:] Upload and visualise these simple tables on KEGG reconstruct pathway

You may feel overwhelmed by the number of pathways, modules and genes available to you. For this specific case, we are interested in Nitrogen metabolism. You may have a look at the Dijkhuizen et al. 2018 paper on Azolla endophytes. Figure 4 of that paper shows the nitrogen metabolism of multiple microbes is plotted. Next, some hypotheses are derived and tested in the wet lab. Does your plot of the nitrogen metabolism overlap with the one published? Or did you maybe discover a new endophyte!

http://www.genome.jp/kegg/tool/map_module.html

In [7]:
ls data/blastKOala -1
bin_1_KO.txt
bin_2_KO.txt
bin_3_KO.txt
bin_4_KO.txt
bin_5_KO.txt
bin_6_KO.txt
KO_combined.txt

A snapshot of the KEGG nitrogen metabolism pathway looks like this (with the KO_combined.txt file as input).

image.pngYou may feel overwhelmed by the number of pathways, modules and genes available to you. For this specific case, we are interested in Nitrogen metabolism. You may have a look at the Dijkhuizen et al. 2018 paper on Azolla endophytes. Find of that paper, this shows the nitrogen metabolism of multiple microbes. Next, some hypotheses are derived and tested in the wet lab. Does your plot of the nitrogen metabolism overlap with the one published? Or did you maybe discover a new endophyte!