This Protocol is listed in the following Categories:
Computational and theoretical biology, Genomics and proteomics

Author(s): Marie-Jose Cros, Erika Sallet, Annick Moisan, Christine Cierco-Ayrolles and Christine Gaspin
Lab/Group: UR875, BIA
DOI: 10.1038/nprot.2007.285

Visualizing and exploring genomic information for non-protein-coding RNA identification using ApolloRNA

Marie-Josée Cros, Marie-Josee.Cros@toulouse.inra.fr, INRA, France

Erika Sallet, Erika.Sallet@toulouse.inra.fr, INRA, France

Annick Moisan, Annick.Moisan@toulouse.inra.fr, INRA, France

Christine Cierco-Ayrolles, Christine.Cierco-Ayrolles@toulouse.inra.fr, INRA, France

Christine Gaspin, Christine.Gaspin@toulouse.inra.fr, INRA, France

Lab/Group: UR875, BIA

Introduction

This protocol describes installation and use of the ApolloRNA software to visualize and explore various genomic information for non-protein-coding RNA (ncRNA) identification in prokaryote. The first part details how to install Apollo1 and the extension ApolloRNA on a computer with Windows, Linux or MAC OS X. The second part describes how to collect and visualize annotations and some information of an organism. This protocol allows to bring together information on an organism to visualize and analyze them locally on a computer in order to define and select valuable ncRNA candidates to be experimentally tested.

Materials

Reagents

Equipment

A modern personal computer with Windows (Microsoft), Linux (e.g. debian or Red Hat) or MacOS X (Apple) operating system is required as well as an Internet connection and any graphical user interface-based web-browser.

Time Taken

The first part can be performed in less than 1h. The second part can be performed in 30mn.

Procedure

Part 1. Apollo and ApolloRNA installation. TIMING 35 min (Windows), 40 min (Linux) or 45 min (MAC)
1| Download and install Apollo using option A, B or C depending on the operating system.
(A) For Windows
(i) In the directory 'My Documents', create the subdirectory 'Applications'.
(ii) In the directory 'Applications', create the subdirectory 'Apollo'.
(iii) Access the download Apollo page at http://www.fruitfly.org/annot/apollo/install.html.
(iv) In the line 'Windows', in the column 'includes Java VM', right-click the link 'Download'. In the menu, click 'Save link as...' and save the file in the directory 'Apollo'.
(v) Double-click the downloaded file 'install.exe' and follow instructions choosing the 'Apollo' directory as installation directory.
(B) For Linux
(i) Create the directory '~/Applications'.
(ii) In the directory 'Applications', create the subdirectory 'Apollo'.
(iii) Access the download Apollo page at http://www.fruitfly.org/annot/apollo/install.html.
(iv) In the line 'Linux', in the column 'includes Java VM', right-click the link 'Download'. In the menu, click 'Save link as...' and save the file in the directory 'Apollo'.
(v) Open a terminal window and move (with the command 'cd') to the directory 'Apollo'. Type: sh ./apolloinst.bin and follow instructions choosing 'Apollo' for installation directory.
(vi) Configure Apollo for Java. Open a terminal window, move to the directory 'Apollo' and type: pwd. The displayed line is the Apollo path, for instance '/home/dupont/Applications/Apollo'. In the subdirectory 'conf' of the directory 'Apollo', open the file 'apollo.cfg'. Add a line defining the JavaPath variable as the Apollo path extended with '/jre', for instance add the line:
JavaPath "/home/dupont/Applications/Apollo/jre"
(C) For Mac
(i) In your user directory, if the directory 'Applications' doesn't exist, create it.
(ii) In the directory 'Applications', create the subdirectory 'Apollo'.
(iii) Access the download Apollo page at http://www.fruitfly.org/annot/apollo/install.html.
(iv) In the line 'Mac OS X', in the column 'without Java VM', right-click the link 'Download'. In the menu, click 'Save link as...' and save the file in the directory 'Apollo'.
(v) Double-click the downloaded file 'install', and follow instructions choosing the 'Apollo' directory as installation directory.
(vi) To be sure Java 1.4 is on your computer, open a terminal window and type: java -version.
If the displayed message starts with 'java version 1.4', go directly to the step 2.
(vii) Access the download Java page at http://www.apple.com/support/downloads/javaupdate142.html. Click the arrow on the Download line and save the file 'Java142Update2.dmg' in your directory 'Applications'. Double-click 'Java142Update2.dmg' and follow instructions.
(viii) Configure Apollo for Java. Open a terminal window and type: which java. The displayed line is the java executable location, for instance '/usr/bin/java'. In the subdirectory 'conf' of the directory 'Apollo', open the file 'apollo.cfg'. Add a line defining the 'JavaPath' variable to the path before 'bin/java', for example:
JavaPath "/usr/"

2| Download and install RNAfold (predict minimum energy secondary structures and pair probabilities) and RNAeval (evaluate energy of RNA secondary structures) of the Vienna RNA package2 (programs for RNA Secondary Structure prediction and comparison) using option A or B depending on the operating system.
(A) For Windows
(i) Access the download ViennaRNA package page at http://www.tbi.univie.ac.at/~ivo/RNA/windoze/.
(ii) Below 'Download', right-click the 'RNAfold.exe' link and save the file in the directory 'Apollo'.
(iii) Repeat the step (ii) to save the file 'RNAeval.exe'.
(iv) In the taskbar, click 'Start' then chose 'Control Panel'. In the new window, double click on 'System' or if it does not exist on 'Performance and Maintenance' then on 'System'. In the 'System properties' window, click the 'Advanced' tab, click the 'Environment Variables' button to open the environment variables window (Figure 1a).
(v) If the user variable 'Path' exists, select the line of the 'Path' variable, click the 'Edit...' button and add ';' at the end of the value followed by the path of the 'Apollo' directory. For instance, if the path of the 'Apollo' directory is 'C:\dupont\Apollo', add text: ;C:\dupont\Apollo.
Else, if the user variable 'Path' does not exist, click the 'New...' button. A new window (Figure 1b) appears that allows to define a new variable named 'Path' with the path of the 'Apollo' directory for value. For instance, if the path of the 'Apollo' directory is 'C:\dupont\Apollo', the value to be typed is: C:\dupont\Apollo.
(B) For Linux or MAC. CRITICAL STEP with administrator privileges.
(i) Access the Vienna RNA package download page at http://www.tbi.univie.ac.at/~ivo/RNA/. Click 'source distribution here' link and save the 'ViennaRNA.gz' file in the directory 'Applications'.
(ii) Open a terminal window and move (with the command 'cd') to the directory 'Applications'. To install the Vienna RNA package, type:
tar xzvf ViennaRNA-1.6.2.tar.gz
cd ViennaRNA-1.6.2
./configure
make
make install

3| Download and install ImageMagick (a software suite to create, edit, and compose bitmap images) using option A or B depending on the operating system. CRITICAL STEP with administrator privileges.
(A) For Windows
(i) Access the download ImageMagick page at http://www.imagemagick.org/script/binary-releases.php#windows.
(ii) In the first line, in the 'HTTP' column, right-click the link 'download'. In the menu, click 'Save link as...' and save the file in the directory 'Applications'.
(iii) Double-click the downloaded file 'ImageMagick-6.3.3-10-Q16-windows-dll.exe' and follow instructions.
(B) For Linux or MAC
(i) Access the download ImageMagick page at http://www.imagemagick.org/script/install-source.php.
(ii) Right-click the link 'ImageMagick.tar.gz'. In the menu, click 'Save link as...' and save the file in the directory 'Applications'.
(iii) Open a terminal window and move to the directory 'Applications'. To install ImageMagick, type:
tar xzvf ImageMagick.tar.gz
cd ImageMagick-6.3.1
./configure
make
make install

4| Download and install GhostScript (an interpreter for the PostScript language and for PDF) using option A or B depending on the operating system. CRITICAL STEP with administrator privileges.
(A) For Windows
(i) In the taskbar, click 'Start' then choose 'Control Panel'. In the new window, click 'Add or Remove Programs'. Look for the program 'Ghostscript 8.56' in the list of programs. If the program belongs to the list, go directly to the step 5.
(ii) Access the download GhostScript page at http://www.ghostscript.com/awki. Click the link 'sourceforge.net'.
(iii) Right-click the link 'gs856w32.exe'. In the menu, click 'Save link as...' and save the file in the directory 'Applications'.
(iv) Double-click the downloaded file 'gs856w32.exe' and follow the instructions.
(B) For Linux or MAC
(i) Open a terminal window and type: gs -h. If the message 'Command not found' is not displayed, go directly to the step 5.
(ii) Access the download GhostScript page at http://www.ghostscript.com/awki. Click the link 'sourceforge.net'.
(iii) Right-click the link 'ghostscript-8.56.tar.gz'. In the menu, click 'Save link as...' and save the file in the directory 'Applications'.
(iv) Open a terminal window and move to the directory 'Applications'. To install GhostScript, type:
tar xzvf ghostscript-8.56.tar.gz
cd ghostscript-8.56
./configure
make
make install

5| Download and install ApolloRNA
(i) In the directory 'Apollo', rename 'apollo.jar' to 'apollo.jar.save'.
(ii) Access the ApolloRNA project page at http://mulcyber.toulouse.inra.fr/frs/?group_id=47. Right-click the link 'apollo.jar'. In the menu, click 'Save link as...' and save the file in the directory 'Apollo'.

[Alert! See author comment below. The correct link is: http://mulcyber.toulouse.inra.fr/gf/project/apollorna/frs/]

Part 2. Load and visualize data about Pyrococcus abyssi GE5 genome. TIMING 30 min
6| Download Pyrococcus abyssi GE5 GenBank annotations in GAME XML format
(i) Access the ftp NCBI page at ftp://ftp.ncbi.nih.gov/genomes/Bacteria/Pyrococcus_abyssi. Click the link 'NC_000868.gbk' and copy the whole text.
(ii) Access to the ApolloRNA convert data page at http://carlit.toulouse.inra.fr/ApolloRNA/convert.html and paste the data in the textarea. In the 'Input format' list, select 'Genbank' and in the 'Output format' list, select 'Game XML'. Click the 'Convert' button.
(iii) Copy the resulting text and paste it in a new file called 'NC_000868.xml'.

7| Download predicted transcription terminator in GAME XML format
(i) Access to the TransTermHP3 page at http://transterm.cbcb.umd.edu/cgi-bin/transterm/predictions.pl and click 'Pyrococcus abyssi GE5'. Copy all the displaying text, except the last part (terminators of the plasmid) starting by the line:
SEQUENCE gi|10954552|ref|NC_001773.1| Pyrococcus abyssi GE5 plasmid pGT5, complete sequence
(ii) Access to the ApolloRNA convert data page and paste the data in the textarea. In the 'Input format' list, select 'TransTermHP', and in the 'Output format' list, select 'Game XML'. In the 'Related genomic sequence coordinates' fields, enter 1 for the beginning and 1765118 for the end. Click the 'Convert' button.
(iii) Copy the resulting text and paste it in a new file called 'NC_000868_TransTermHP.xml'.

8| Visualizing both information
(i) Launch Apollo using option A or B depending on the operating system.
(A) For Windows: In the directory 'Apollo', click the Apollo icon.
(B) For Linux or MAC: Open a terminal window, move to the directory 'Apollo' and type: bin/Apollo.
The Apollo load data window (Figure 1c) appears.
(ii) In the list 'Choose data source', choose 'GAME XML format'. Click the 'Browse...' button: a window appears. Select the file 'NC_000868.xml' and click the 'Open' button. In the Apollo load data window, click the 'Ok' button. When Apollo has loaded data, the Apollo main window appears.
(iii) Open the 'File' menu and click the 'Layer more results or annotations...' item. In the 'Choose data source' list, choose 'GAME XML format'. Click the 'Browse...' button. Select the file 'NC_000868_TransTermHP.xml' and click the 'Open' button. In the Apollo load data window, click the 'Ok' button.

9| Visualizing a predicted secondary structure, example for a tRNA
(i) Click twice on the 'X10' zoom button. The main Apollo window displays both GenBank annotations (in the blue parts) and TransTermHP data (in the black parts) of Pyrococcus abyssi genome. Click the inorganic polyphosphate/ATP-NAD kinase protein annotation to display (Figure 1d) more information about the annotation.
(ii) Open the 'Edit' menu and click the 'Find...' item. A new 'Find' window opens. In the field 'Name', type: PABt04. Click the 'Find' button on the line. The Apollo main view zooms on the tRNA PABt04. Click the 'X.1' zoom button to change scale.
(iii) Open the 'RNA' menu and click the 'Secondary structure compute...' item. A new window (Figure 2a) opens that displays the predicted secondary structure of the tRNA and the associated free energy. Click the 'Close' button.

10| Visualizing possible RNA/RNA interactions, example for the interaction between a sno C/D RNA and the 16S rRNA
(i) In the opened 'Find' window, in the field 'Name', type now: PABsnRNA29. Click the 'Find' button on the line. The Apollo main view zooms on the sno C/D box RNA called 'PABsnRNA29'.
(ii) Now select the subsequence from 1403702 to 1403711 (a subpart of the snoRNA) that is known4 to interact with the 16S rRNA. That is, press the middle mouse button at the position 1403702 immediately above the axis, and drag the mouse to 1403711 surrounding the sequence 'TGTAGCGCGC' (Figure 2b). A window opens that shows the selected sequence. Close it.
(iii) Open the 'RNA' menu and click the 'Search RNA/RNA interaction' item. A window (Figure 2c) opens for configuring the interaction search.
(iv) In the top part of the window, type the genomic positions: 205039 to 206541 (the 16S rRNA positions). In the strand list, choose the strand +.
(v) In the bottom part, click the 'Search' button to launch the search of targets in the specified regions. After a few seconds, a new window (Figure 2d) opens that lists the predicted targets of the previously selected sequence. The first on the list is the good4 one.
(vi) Click the first interaction picture to visualize the interaction in a new window (Figure 2e). Click the 'Close' button and then click the 'Close' button on the previous window.

11| Visualizing graphs
(i) Go to the ApolloRNA page http://carlit.toulouse.inra.fr/ApolloRNA/apollornauserguide.html, click the link 'NC_000868_mfe150.txt' and save the file on your computer. This file contains the minimum free energy computed on a sliding window of 150 nucleotides.
(ii) Open the 'Analysis' menu and click the 'Show quantitative variable...' item. A new window (Figure 3a) opens.
(iii) In the first table of the window, in the first line, click the 'Load...' button. Select the file 'NC_000868_mfe150.txt' and click the 'Open' button. In the column 'Color', click the black rectangle, choose the red color and click the 'OK' button.
(iv) In the second table of the window, in the third line 'ComputeGC%': click the 'Show' checkbox, in the column 'Color', click the black rectangle and choose the blue color. Click the 'OK' button and then click the 'Close' button.
(v) In the yet opened 'Find' window, in the field 'Name', type now: 16S. Click the 'Find' button on the line. Click the 'X.1' zoom button to change scale. Move down the superior vertical scrollbar to see RNA annotations on the direct strand. The Apollo main window (Figure 3b) displays now two additional graphs that vary on the ribosomal RNAs region.

Troubleshooting

If troubleshooting was encountered in installing Apollo, notes of the page 'Download and Install Apollo' ( http://www.fruitfly.org/annot/apollo/install.html ) could help.
If the genome is long or information files voluminous, loading time could be long (depending on the characteristics of the computer).

Critical Steps

Steps that require administrator privileges on the computer are emphasized as critical.

Anticipated Results

In the protocol, the page http://carlit.toulouse.inra.fr/ApolloRNA/convert.html was mentioned to convert GenBank annotation data or TransTermHP predictions of transcription terminator in an Apollo suitable GAME XML format. This page could also be useful for converting data as CRISPRfinder5 predictions of CRISPR (Clustered Regularly Interspaced Short Palindromic Repeats), MOSAIC6 comparative data of closely related bacterial genomes, PatScan7 pattern matching data, DARN8 non-coding RNA detection data. On request, convert capabilities could be extended.
Beside the individual interest of the environment, we found useful to share in a project a common representation of information. That is achieved by defining how information is displayed (color, shape, individual or shared track...). This could then be stored in a type preference file which could be duplicated. Another interesting feature is the ability for several persons to walk along the genome on an inflated video projection of the representation and discuss on particular regions. Finally the software could support interdisciplinary work allowing a common representation. This is put into practice on three genomes: Pyrococcus abyssi, Staphylococcus aureus and Ralstonia solanacearum. For S. aureus, the approach for RNA/RNA interaction prediction was successfully used to predict targets9 of the RNA III and some small ncRNAs candidates have yet been experimentally validated (to be published).

References

1. Lewis, S.E. et al. Apollo : a sequence annotation editor. Genome Biol. 3, R12, (2002).
2. Hofacker, I.L. Vienna RNA secondary structure server. Nucleic Acids Res. 31, 3429-3431, (2003).
3. Kingsford, C., Ayanbule, K., Salzberg, S.L. Rapid, accurate, computational discovery of Rho-independent transcription terminators illuminates their relationship to DNA uptake. Genome Biol. 8, R22, (2007).
4. Gaspin, C. et al. Archaeal homologs of eukaryotic methylation guide small nucleolar RNAs: lessons from the Pyrococcus genomes. J. Mol. Biol. 297, 895-906, (2000).
5. Grissa, I., Vergnaud, G., Pourcel C. CRISPRFinder: a web tool to identify clustered regularly interspaced short palindromic repeats. Nucleic Acids Res. published online on May 31, (2007).
6. Chiapello, H. et al. Systematic determination of the mosaic structure of bacterial genomes: species backbone versus strain-specific loops. BMC Bioinformatics 6, 171, (2005).
7. Dsouza, M., Larsen, N., Overbeek, R. Searching for patterns in genomic data. Trends Genet. 13, 497-498, (1997).
8. Zytnicki, M., Gaspin, C., Schiex, T. DARN! A soft constraint solver for RNA motif localization. To be published in Constraints, (2007).
9. Boisset, S. et al. Staphylococcus aureus RNAIII coordinately represses the synthesis of virulence factors and the transcription regulator Rot by an antisense mechanism. Genes Dev. 21, 1353-1366, (2007).

Acknowledgements

Keywords

ncRNA, visualization, annotation

ApolloRNA Figures

This word file contains larger versions of all the figures associated with this protocol.
ApolloRNA Figures.doc

Figure 1

Windows for configuring Windows operating system: (a) and (b) and displayed windows calling Apollo (c) and (d).


Figure 2

Displayed windows looking for a secondary structure: (a) and searching RNA/RNA interactions: (b), (c), (d) and (e).


Figure 3

Displayed windows visualizing quantitative variables: (a) and (b).


Comments

For step 5 Download and install ApolloRNA, go now to http://mulcyber.toulouse.inra.fr/gf/project/apollorna/frs/
to download apollo.jar.

We are really sorry for this address change due to the upgrade of our source forge.
Marie-Josee Cros

Please note that Apollo pages have moved.

In step 1 and section Troubleshooting, the download page is now http://apollo.berkeleybop.org/current/install.html

Marie-Josee Cros

Post a comment


Extra navigation

Search Protocols

Feedback

2 comments have been posted on this protocol

ADVERTISEMENT