


Some features of repvis can optionally be influenced via configuration files stored in the directory .reputer in the user's home directory.
#genome name translation table
Full_Organism_Name=filename_prefix
For the example below, the following lines in the genome_names file produce the translated list:
#genome name translation table
Mycobacterium tuberculosis H37Rv=mtub
Neisseria meningitidis=nmen
Rhizobium sp. NGR234=pNGR234
Pyrococcus abyssi=pabyssi
Pyrococcus horikoshii=pyro
Rattus norvegicus mRNA=rn_mrna
Rickettsia prowazekii=rpxx
Saccharomyces cerevisiae=s_cerevisiae
The keyword REPSEQ specifies the position where the repeats sequence selected in the Inspector window is inserted. The keyword must be written at the start of a single line.
<html>
<body>
You have selected the following sequence:<p>
REPSEQ
<body>
<html>
In this example, REPSEQ will automatically be replaced by the repeats sequence. The resulting html code is then transferred to the Netscape browser, e.g.:
<html>
<body>
You have selected the following sequence:<p>
>ID=17958, Kind=F, Length=71, Pos1=384257, Pos2=1316890, Spacer=932562, Exact, E-Value=1.57e-31
gctatgaagtttgcctggctttcagtggcaactgccttagctagcaaggtcttaccagttcccggtgggcc
<body>
<html>

To list all available command line options call repvis with -help:
> repvis -help
*** repvis - Repeats Visualizer v0.3
*** Compiled by chris@kakerlake, Sun Apr 23 13:19:13 CEST 2000
*** gcc version 2.95 19990728 (release)
*** (c) 1998-2000 Stefan Kurtz and Chris Schleiermacher
*** Contact: reputer@genomes.de
*** For updates visit: http://www.genomes.de
Name
repvis
Synopsis
repvis [Options] directory
Description
This program visualizes 4 kinds of repeats as square, circle or
dot plot graph.
Options
-seqpath Override path for input sequence stored in .bin file.
-help Print help message.
-noseq Do not read in source sequence.
-png Generate a png-image in batch mode.
-pngprefix Prefix for png image.
-l size In batch mode: display repeats of this size and above.
-f In batch mode: display forward repeats
-r In batch mode: display reverse repeats
-c In batch mode: display complemented repeats
-p In batch mode: display reverse complemented repeats
-width In batch mode: png x size in pixels
-height In batch mode: png y size in pixels
directory Directory containing repfind binary output files
with .bin extensions.
In batch mode this must be a single repfind file.
The following options can only be used in connection with the -png option.

repvis expects repfind/repselect result data files to have the file extension '.bin'. By specifying a directory at startup, this location is scanned for .bin files.
If repfind encounteres a result data files, which was either not generated in your filesystem or the original DNA sequence was (re)moved, the program asks you to browse for the missing file.
Selecting No disables certain sequence related features like BLAST database queries. The same is archieved by the command line option -noseq. The advantage of ommitting the DNA sequence is an increased program performance.
Eventually, the repvis user interface is launched. The screenshot
below is a clickable map: select a region to get more information.
Repeats Graph
Clicking the graph window evokes the Back...
Color Key
Genome Selector
For better readability, a translation list may be used to display full organism names in addition to the filenames (see section Configuring).
Graphics Type
According to the type of graph and the kind of repeats selected by the user,
this panel displays either forward, reverse complemented (palindromic), complemented or reversed repeats as square graph, circle graph or dot plot.
The color key associates a color to a certain range of repeats sizes. In the
graph given above, repeats of sizes 4877 to 5502 are displayed as yellow lines, for
example.
The length of the shortest and longest repeat are marked in red color. In our example this is 2378 and 8626.
Back...
Clicking this button evokes a browser containing all files with the extension .bin in the directory specified when calling repvis.
Three different types of graphical representation are available:
| Square Graph | |
![]() |
|
|
Circle Graph |
Dot Plot |
|
|
Color Scheme Selector
Repeats Data Investigator
Depending on the current setting of the selection slider and the limit choice box,
this window displays data for all repeats as extracted with repselect,
fulfilling the selection slider setting:
The "Show Sequence" check button adds the DNA sequence to the list, "Show mismatch as IUB Code" switches from the IUB representation of incompletely specified residues to a list of all possible bases.
Repeats Type Selector
The button label contains the number of repeats extracted from the binary input file for each kind.
Our example above lists 500 forward repeats. Since there are no P, C or R repeats, the respective
buttons are disabled.
Selection Criterium Selector
Selection Value Slider
Program Status
After the Inspector window is launched, the user can zoom in to, or out from a
certain region for closer examination of repeats structures. This is achieved
by left or right clicking the repeats graph.
Note that the lines connecting the starting positions of a repeat are not drawn under the following condition:
Instead a short vertical marker represents the connecting line.
Clicking a repeat region in the upper sequence symbol directly, the repeat
data associated with this position are displayed in the Data Browser. Finally
the repeat sequence can be displayed with "View Sequence" or submitted to a
database query with "Transfer to Netscape".
In the Inspector window, the "Save Subsequence" button allows to save a part of the current source sequence.
An additional file can be created by hand or shell scipts to annotate the repeats visualization.
The annotation file must be a text file of the following format:
Text_Symbol Position_1 Position_2 Color_Code #Text_Comment
E.g.:
The Text_Symbol is translated into a graphical symbol.
Position_1 and Position_2 denote the starting and ending position of the annotation symbol. Note, that some symbols principially require only one position, as the vertical line. Nevertheless a second position must be supplied which can have an arbitrary value. The Color_Code can either be a hexadecimal value starting with a hash character #RRGGBB, consisting of three two digit hexadecimal values for red, green and blue. For example, #FF0000 means red, #00FF00 means green. Alternatively, Color_Code can be a colorname as specified here.
Two color schemes are available. The first creates a more 'esthetic' image of the repeats
while the second emphasizes the longest, i.e. most important repeats.
These four buttons specify the kind of repeat to display:
Back...
This choice box determines the repeats selection criterium.
Back...
The Selection Value slider allows the selection of repeats according to the
setting of the Seleciton Criterium Selector.
The boundaries of the slider are either the shortest/longest repeat from the current repeat
kind or the smallest/largest E-Value.
Back...
This line reports various program status messages. Clicking on the status line brings up
a status history.
Back...

The Inspector Window

This feature uses the remote control facility of the Netscape Navigator browser as described
in [3].


Exporting a Subsequence

The starting and ending position for the subsequence can either be typed in directly or be specified from the Inspector window by clicking the upper sequence symbol.
The subsequence can be saved in FASTA or flat format. The flat format contains no header line and no line breaks.
This switch includes the file extension of the source sequence in the new filename.
The directory location where the subsequence is saved to. Type in directly or use the ".." button to browse for a different path.
This automatically assembled filename for the subsequence consists of some or all of the following components: the basename of the source sequence, its original extension, starting and ending position and a .fna or .seq extension.

Annotating the Visualization
= 125 517 #000000 #1.13 Term
<= 682 966 #0000FF #1.12 Intr
<= 1494 2479 #0000FF #1.11 Intr
<= 2644 2702 #0000FF #1.10 Intr
<= 2772 2935 #0000FF #1.09 Intr
<= 3393 4154 #0000FF #1.08 Intr
<= 4327 4431 #0000FF #1.07 Intr
|
Available Text Symbols