EBI Banner

Results of Search:

Program: fasta3_t
Database: EHUM
Title: Sequence
SeqLen: 24


 FASTA searches a protein or DNA sequence data bank
 version 3.2t09 December 7, 1999
Please cite:
 W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448

/net/nfs0/vol1/production/w3nobody/tmp/262525.2545: 24 nt
 >Sequence
 vs  EMBL Human library
searching /ebi/services/idata/fastadb/em_hum library

       opt      E()
< 20    89     0:=
  22    28     0:=          one = represents 170 library sequences
  24   119     0:=
  26   208     2:*=
  28   366    27:*==
  30   677   162:*===
  32  1459   627:===*=====
  34  2332  1702:==========*===
  36  3521  3495:====================*
  38  5117  5775:===============================  *
  40  7428  8056:============================================   *
  42  8862  9848:=====================================================    *
  44  9256 10863:=======================================================    *
  46 10186 11064:===========================================================*
  48  9983 10592:===========================================================*
  50  9617  9666:========================================================*
  52  8705  8498:=================================================*==
  54  7292  7259:==========================================*
  56  7093  6063:===================================*======
  58  5492  4978:=============================*===
  60  4504  4032:=======================*===
  62  3724  3233:===================*==
  64  2636  2571:===============*
  66  2138  2032:===========*=
  68  1816  1598:=========*=
  70  1264  1253:=======*
  72   952   979:=====*
  74   708   763:====*
  76   516   594:===*
  78   421   462:==*
  80   323   358:==*
  82   223   274:=*
  84   200   217:=*
  86   135   168:*
  88   116   130:*          inset = represents 2 library sequences
  90    98   101:*
  92    53    78:*         :===========================           *
  94    34    60:*         :=================            *
  96    33    47:*         :=================      *
  98    28    36:*         :==============   *
 100    14    28:*         :=======      *
 102    14    22:*         :=======   *
 104     6    17:*         :===     *
 106     5    13:*         :===   *
 108     5    10:*         :=== *
 110     7     8:*         :===*
 112     3     6:*         :==*
 114     3     5:*         :==*
 116     2     4:*         :=*
 118     2     3:*         :=*
>120     3     2:*         :*=
824675648 residues in 113725 sequences
 statistics extrapolated from 50000 to 117785 sequences
  Expectation_n fit: rho(ln(x))= 4.4212+/-0.000225; mu= 11.0369+/- 0.017;
 mean_var=31.7879+/- 5.255, 0's: 1 Z-trim: 26  B-trim: 1833 in 3/84
 Kolmogorov-Smirnov  statistic: 0.0231 (N=29) at  50

FASTA (3.28 September 1999) function [optimized, +5/-4 matrix (5:-4)] ktup: 1
 join: 70, opt: 55, gap-pen: -16/ -4, width:  16
 Scan time: 832.400
The best scores are:                             initn init1 opt z-sc E(117785)
EM_HUM:HSB1 L22754 HUMAN BETA-GLOBIN  (2999) [f]  120  120  120  180.5  0.0061
EM_HUM:HSHBB U01317 HUMAN BETA GLOBIN (73308) [f]  120  120  120  155.4  0.0063

>>EM_HUM:HSB1 L22754 HUMAN BETA-GLOBIN CLUSTER GENE, ENH  (2999 nt)
 initn: 120 init1: 120 opt: 120 Z-score: 180.5 expect() 0.0061
 100.000% identity in 24 nt overlap (1-24:2688-2711)

                                             10        20          
Sequen                               GAATTCTAATCTCCCTCTCAACCC      
                                     ::::::::::::::::::::::::      
EM_HUM TGATTTAAGCCTTTTTGGTCATAAAACATTGAATTCTAATCTCCCTCTCAACCCTACAGT
      2660      2670      2680      2690      2700      2710       

EM_HUM CACCCATTTGGTATATTAAAGATGTGTTGTCTACTGTCTAGTATCCCTCAAGCAGTGTCA
      2720      2730      2740      2750      2760      2770       

>>EM_HUM:HSHBB U01317 HUMAN BETA GLOBIN REGION ON CHROMO  (73308 nt)
 initn: 120 init1: 120 opt: 120 Z-score: 155.4 expect() 0.0063
 100.000% identity in 24 nt overlap (1-24:1-24)

               10        20                                        
Sequen GAATTCTAATCTCCCTCTCAACCC                                    
       ::::::::::::::::::::::::                                    
EM_HUM GAATTCTAATCTCCCTCTCAACCCTACAGTCACCCATTTGGTATATTAAAGATGTGTTGT
               10        20        30        40        50        60



24 residues in 1 query   sequences
824675648 residues in 113725 library sequences
 Tcomplib (4 proc)[version 3.2t09 December 7, 1999]
 start: Sun Aug  6 19:29:32 2000 done: Sun Aug  6 19:33:31 2000
 Scan time: 832.400 Display time:  0.080

Function used was FASTA