Frequently Asked Questions

How Can We Help You?

Find answers to the most common questions about novelBGC.

Popular Topics:
Getting Started Installation BGC Analysis antiSMASH File Formats
General

Basic information about novelBGC and how to get started

Technical

Installation, configuration, and technical requirements

Analysis

Understanding BGC analysis workflows and results

Tools

Information about integrated tools and methods

Data & Files

File formats, uploads, and data management

GENERAL

General Questions

novelBGC is a comprehensive bioinformatics platform designed for the discovery, analysis, and characterization of novel Biosynthetic Gene Clusters (BGCs). It integrates multiple tools including antiSMASH, BiG-SLICE, and custom algorithms to help researchers identify and study secondary metabolite biosynthetic pathways in microbial genomes.

novelBGC is designed for researchers, bioinformaticians, and students working in the fields of microbiology, natural products, and drug discovery. The platform is accessible to users with varying levels of bioinformatics expertise, from beginners to advanced researchers.

Yes, novelBGC is free for academic and non-commercial research purposes. We offer different tiers of service including a free tier with standard features and premium tiers for advanced features and higher computational resources. Please check our pricing page for detailed information.

If you use novelBGC in your research, please cite our publication: [upcoming]. Additionally, please cite the individual tools used in your analysis (e.g., antiSMASH, BiG-SLICE) as they are integral components of our pipeline.
TECHNICAL

Technical Questions

novelBGC is a web application and cannot be installed locally. To use robust pipelines and install similar BGC analysis workflows locally, you may refer to bgcflow, which is designed for local installation and advanced analysis.

novelBGC is built using:
Backend: Python (Flask), Nextflow for workflow management
Frontend: HTML5, CSS3, JavaScript, Bootstrap
Database: PostgreSQL
Task Queue: Celery with Redis

novelBGC is tested and supported on all major desktop browsers, including Google Chrome, Mozilla Firefox, and Microsoft Edge. Please note that while the site is accessible on mobile browsers, the user experience is not fully optimized for mobile devices and certain features may not display or function correctly.
ANALYSIS

Analysis Questions

novelBGC provides the following types of analysis:
• Assessment of input assembled genome quality
• BGC identification and annotation using antiSMASH
• Analysis of each BGC for similarity to known MIBiG gene clusters
• Calculation of distance to the closest gene cluster family (via clustering)
• Detection of antimicrobial resistance (AMR) genes in BGCs based on the CARD database

Analysis time varies depending on:
• Genome size (typically 5-15 minutes for bacterial genomes)
• Server load

You'll receive an email notification when your analysis is complete, and you can monitor progress in the Job Status page.

The novelty score is calculated based on multiple factors:
• Sequence similarity to known BGCs in MIBiG database
• Gene cluster family membership
• Distance to the closest gene cluster family
• Gene cluster family confidence (based on MIBiG presence in GCF)
• Antimicrobial resistance (AMR) gene presence
• Contig edge proximity

The novelty score is calculated using a custom algorithm that combines these factors into a single score. Scores range from 0 (known BGC) to 1 (completely novel).

The results page in novelBGC is divided into four main sections to help you interpret your analysis:
  • Summary: Overview of your submission, including job details and key findings.
  • Genome Statistics: Information on the quality and features of your input genome, such as total length, N50, and other relevant metrics.
  • BGC Distribution: Visualization and summary of all identified Biosynthetic Gene Clusters (BGCs) in your genome, including their types and locations.
  • BGC Prioritization: A ranked list of BGCs prioritized based on their novelty scores, providing detailed information for each cluster to help you focus on the most potentially novel or interesting BGCs.
DATA & FILES

Data & Files Questions

novelBGC supports the following input formats:
Genome sequences: FASTA (.fasta, .fa, .fna)
Maximum file size: 100MB

To upload files:
1. Navigate to the Upload page
2. Click "Choose Files" or drag-and-drop your files
3. Give a Genus, Species, and Strain name for your genome
4. Email address (optional) to receive your job ID via email for easy access to results later
5. To tweak weights and thresholds, you can use the Advanced Settings section (Optional)
6. Click "Start Analysis"

Results are stored for 30 days. You can download your results at any time during the storage period. We recommend downloading important results promptly.

Yes! You can download:
• Complete result packages (ZIP archive)
• Individual analysis files
• Interactive HTML reports
• Raw data in various formats (JSON, CSV, TSV)
• Publication-ready figure of the BGC distance to closest GCF distribution

All downloads are available from the Job Status or Results page.
TOOLS

Integrated Tools Questions

novelBGC currently uses antiSMASH 6.1.1. We are working to update to the newest versions to ensure you have access to the latest features and improvements. You can check the exact version in use on the About page.

Yes! novelBGC allows you to customize various parameters:
• antiSMASH detection strictness
• Distance to closest GCF thresholds
• Weights for the N/RS score

Advanced users can access these options in the "Advanced Settings" section during upload.

novelBGC integrates several powerful tools:
antiSMASH: BGC identification
BiG-SLICE: BGC family clustering
QUAST: Genome quality assessment
Prokka: Genome annotation
Custom algorithms: Novelty scoring, distance calculation
CARD RGI: Antimicrobial resistance (AMR) gene detection
MIBiG: BGC similarity search against the MIBiG database
BIGFAM: BGC family clustering
And more tools are being added regularly!

Still have questions?

Can't find the answer you're looking for? Our support team is here to help!

Contact Support

novelBGC is free to use for everybody to use, even commercially. See our license for details.