Frequently asked questions

Which genomes are searched?

Tool 1 uses the tool_1.nf workflow to query the selected genomes in the input_tool1 folder.

Additional, the tool_1_EVERYTHING.nf workflow can be used to search ALL the genomes in the LIBRARY folder.

Which genomes are in the LIBRARY folder?

All genomes assembled by the Core appliance can be found in the Tool 1 LIBRARY folder. This is updated manually by running populate_inputs.sh. Instructions on how to run populate_inputs.sh can be found here. The files are hard linked, so they are not duplicated on the server.

Additionally, reference genomes can be added to this folder.

Which databases are available?

The available databases are:

Database	command	Database contents
ARG-ANNOT	`argannot`	Antibiotic resistance genes
CARD	`card`	The Comprehensive Antibiotic Resistance Database
EcOH	`ecoh`	Serotyping of E. coli
ecoli_vf	`ecoli_vf`	Escherichia coli virulence factors. Included VFDB and more E.coli specific genes
NCBI BARRGD	`ncbi`	NCBI Bacterial Antimicrobial Resistance Reference Gene Database
PlasmidFinder	`plasmidfinder`	In silico detection and typing of plasmids
ResFinder	`resfinder`	ResFinder identifies acquired antimicrobial resistance genes
VFDB	`vfdb`	Virulence factor database

What are the different folders for?

The Tool 1 file system contains the following folders:

Directory name	Description of contents
LIBRARY	The sequence files (`.fa`) for all sequenced isolates
input_tool1	All assemblies (`*.fa`) in this directory will be tested by tool_1.
output_tool1	Your output data will be deposited here in subdirectories named `xxx_i(yy)_c(zz)_YYYY-MM-DD_HH:MM`, where `xxx` is the database, `yy` is the identity and `zz` is the coverage. This is followed by the date and time the run was started.
work	The Nextflow working directory.

How do you run Tool 1?

Running Tool 1 is straight-forward.

Briefly, once you have accessed Tool 1 and placed the assemblies you wish to test into the input folder all you need to do is enter a single command and the results will be deposited in the output folder.

What documents are made by Tool 1?

In the run output folder, Tool 1 deposits the documents in 2 sub-folders: isolate and summary.

In the isolate folder there will be a results file for each strain. In the summary folder there will be a single document summarising all ‘hits’ results.

Additionally, in the run output folder itself you will find run_parameters.txt. This text document lists:

the database used (`xxx`)
the number of sequences in the database,
the date the database was downloaded,
the version number of the software (abricate),
the minimum DNA %identity (`yy`) and
the minimum DNA %coverage (`zz`).

Ignore help=false. This is an expected byproduct of the help/usage message.

How do you view the output files?

You can open the files in Excel. The output files are in .tab format.

The run_parameters.txt document can be opened in any text editor.

Can I change the settings of Tool 1?

Yes, you can modify the two different settings: identity and coverage. Using the default settings there must be a minimum DNA identity of 80%, and a minimum DNA coverage of 80% for a match to be recorded. These can be altered when you set Tool 1 to run.

For all runs, the settings are recorded in the name of the newly created output folder, e.g. the results of a run on May 21st 2021 started at 1.30 pm using the resfinder database and default settings will be stored in this folder: resfinder-i80-c80__2021-05-21_13:15/.

Where can I find further information about Tool 1?

Tool 1 uses the ABRicate software tool. It was developed by Torsten Seemann and Björn Grüning in Melborne, Australia. The ABRicate information page can be found here on GitHub.