TOIG

[ Program Manual | User's Guide | Data Files | Databases ]

Table of Contents
FUNCTION
DESCRIPTION
EXAMPLE
OUTPUT
INPUT FILES
RELATED PROGRAMS
COMMAND-LINE SUMMARY
LOCAL DATA FILES
OPTIONAL PARAMETERS

FUNCTION

[ Top | Next ]

ToIG converts GCG sequence file(s) into a single file in IntelliGenetics format.

DESCRIPTION

[ Previous | Top | Next ]

Any sequence file in GCG format can be converted with ToIG into a format suitable for use in the IG programs. ToIG accepts one or more GCG sequence files as input and creates one output file, containing all the sequences converted to IG format.

EXAMPLE

[ Previous | Top | Next ]

In this session with ToIG, all of the individual files generated in the sample run of FromIG are put back into IG format.


% toig

  TOIG of what GCG sequence(s) ?  sur*

  surphist1 788 bp
  surphist2 188 bp
  surphist3 159 bp

  ////////////////

  surshist2 682 bp

  What should I call the output file (* surphist1.ig *) ?  test.ig

%

OUTPUT

[ Previous | Top | Next ]

Here is part of the output file:


;  TOIG of: surphist1  check: 6642  from: 1  to: 788
;
;
;  FROMIG of: urchin.nih
;
;  definition  sea urchin(p.mil.) histone genes; h4 gene. 788bp
;  locus       surphist1       788 bp                  updated   11/01/82

/////////////////////////////////////////////////////////////////////////

;  (circular sequence)
; surphist1  Length: 788  October 16, 1996 15:09  Type: N  Check: 6642  ..
surphist1
CAACATATTAGAGGAAGGGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGGGGGGGGGGGAGGGAGAAT
TGCCCAAAACACTGTAAATGTAGCGTTAATGAACTTTTCATCTCATCGACTGCGCGTGTATAAGGATGAT

//////////////////////////////////////////////////////////////////////

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNGGCCGAACACTGTACGGCTTCGGCGGCTAAGTGAAGCA
GACTTGGCTAGAATAACG2

//////////////////////////////////////////////////////////////////////

;  TOIG of: surshist2  check: 2233  from: 1  to: 682
;
;
;  FROMIG of: urchin.nih
;
;  definition  sea urchin(s.purpuratus) histone genes; h2a. 682bp
;  locus       surshist2       682 bp                  updated   11/01/82

/////////////////////////////////////////////////////////////////////////

;  (linear sequence)
; surshist2  Length: 682  October 16, 1996 15:09  Type: N  Check: 2233  ..
surshist2
TCCATTCAAGTCATCGAACATTGTTACGTTCTGAACTTCGTCTTCCGATTTATTCTAAACTCATCAACAA
CATCATGTCTGGCAGAGGAAAGAGTGGAAAGGCCCGCACCAAGGCAAAGACGCGCTCATCCCGTGCAGGG

//////////////////////////////////////////////////////////////////////

TCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCCCTCTCTTAGTTTAAAACGCTACACTTGGG
ATGAACTAAGCCTTAACAGTTGTTGTATATAATGATTGATATATATTAATAA1

INPUT FILES

[ Previous | Top | Next ]

ToIG accepts multiple (one or more) nucleotide or protein sequences as input. You can specify multiple sequences in a number of ways: by using a list file, for example @project.list; by using an MSF or RSF file, for example project.msf{*}; or by using a sequence specification with an asterisk (*) wildcard, for example GenEMBL:*. The input files in this example are the output files from the example for FromIG. To create the input files for this example, run FromIG on urchin.nih.

RELATED PROGRAMS

[ Previous | Top | Next ]

The following programs convert sequences between other formats and GCG format: FromEMBL, FromGenBank, FromIG, FromPIR, FromStaden, FromFastA, ToIG, ToPIR, ToStaden and ToFastA.

DataSet creates a GCG data library from any set of sequences in GCG format. GCGToBLAST creates a database that can be searched by the BLAST program from any set of sequences in GCG format.

COMMAND-LINE SUMMARY

[ Previous | Top | Next ]

All parameters for this program may be put on the command line. Use the parameter -CHEck to see the summary below and to have a chance to add things to the command line before the program executes. In the summary below, the capitalized letters in the parameter names are the letters that you must type in order to use the parameter. Square brackets ([ and ]) enclose parameter values that are optional. For more information, see "Using Program Parameters" in Chapter 3, Using Programs in the User's Guide.


Minimal Syntax: % toig [-INfile=]Sur* -Default

Prompted Parameters: (for single sequences)

-BEGin=1 -END=444       range of sequence to convert
-REVerse                uses the reverse strand (nucleic acid sequences)

Other Prompted Parameters:

[-OUTfile=]seqname.ig   names the output file

Local Data Files:  None

Optional Switches:

-STAden                 converts Staden format file(s) into IG format

LOCAL DATA FILES

[ Previous | Top | Next ]

None.

OPTIONAL PARAMETERS

[ Previous | Top | Next ]

The parameters listed below can be set from the command line. For more information, see "Using Program Parameters" in Chapter 3, Using Programs in the User's Guide.

-STAden

allows input files to be in Staden format instead of GCG format.

Printed: November 18, 1996 13:07 (1162)

[ Program Manual | User's Guide | Data Files | Databases ]


Documentation Comments: doc-comments@gcg.com
Technical Support: help@gcg.com

Copyright (c) 1982, 1983, 1985, 1986, 1987, 1989, 1991, 1994, 1995, 1996, 1997 Genetics Computer Group, Inc. a wholly owned subsidiary of Oxford Molecular Group, Inc. All rights reserved.

Licenses and Trademarks Wisconsin Package is a trademark of Genetics Computer Group, Inc. GCG and the GCG logo are registered trademarks of Genetics Computer Group, Inc.

All other product names mentioned in this documentation may be trademarks, and if so, are trademarks or registered trademarks of their respective holders and are used in this documentation for identification purposes only.

Genetics Computer Group

www.gcg.com