FASTA searches a protein or DNA sequence data bank version 3.2t09 December 7, 1999 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 /net/nfs0/vol1/production/w3nobody/tmp/262525.2545: 24 nt >Sequence vs EMBL Human library searching /ebi/services/idata/fastadb/em_hum library opt E() < 20 89 0:= 22 28 0:= one = represents 170 library sequences 24 119 0:= 26 208 2:*= 28 366 27:*== 30 677 162:*=== 32 1459 627:===*===== 34 2332 1702:==========*=== 36 3521 3495:====================* 38 5117 5775:=============================== * 40 7428 8056:============================================ * 42 8862 9848:===================================================== * 44 9256 10863:======================================================= * 46 10186 11064:===========================================================* 48 9983 10592:===========================================================* 50 9617 9666:========================================================* 52 8705 8498:=================================================*== 54 7292 7259:==========================================* 56 7093 6063:===================================*====== 58 5492 4978:=============================*=== 60 4504 4032:=======================*=== 62 3724 3233:===================*== 64 2636 2571:===============* 66 2138 2032:===========*= 68 1816 1598:=========*= 70 1264 1253:=======* 72 952 979:=====* 74 708 763:====* 76 516 594:===* 78 421 462:==* 80 323 358:==* 82 223 274:=* 84 200 217:=* 86 135 168:* 88 116 130:* inset = represents 2 library sequences 90 98 101:* 92 53 78:* :=========================== * 94 34 60:* :================= * 96 33 47:* :================= * 98 28 36:* :============== * 100 14 28:* :======= * 102 14 22:* :======= * 104 6 17:* :=== * 106 5 13:* :=== * 108 5 10:* :=== * 110 7 8:* :===* 112 3 6:* :==* 114 3 5:* :==* 116 2 4:* :=* 118 2 3:* :=* >120 3 2:* :*= 824675648 residues in 113725 sequences statistics extrapolated from 50000 to 117785 sequences Expectation_n fit: rho(ln(x))= 4.4212+/-0.000225; mu= 11.0369+/- 0.017; mean_var=31.7879+/- 5.255, 0's: 1 Z-trim: 26 B-trim: 1833 in 3/84 Kolmogorov-Smirnov statistic: 0.0231 (N=29) at 50 FASTA (3.28 September 1999) function [optimized, +5/-4 matrix (5:-4)] ktup: 1 join: 70, opt: 55, gap-pen: -16/ -4, width: 16 Scan time: 832.400 The best scores are: initn init1 opt z-sc E(117785) EM_HUM:HSB1 L22754 HUMAN BETA-GLOBIN (2999) [f] 120 120 120 180.5 0.0061 EM_HUM:HSHBB U01317 HUMAN BETA GLOBIN (73308) [f] 120 120 120 155.4 0.0063 >>EM_HUM:HSB1 L22754 HUMAN BETA-GLOBIN CLUSTER GENE, ENH (2999 nt) initn: 120 init1: 120 opt: 120 Z-score: 180.5 expect() 0.0061 100.000% identity in 24 nt overlap (1-24:2688-2711) 10 20 Sequen GAATTCTAATCTCCCTCTCAACCC :::::::::::::::::::::::: EM_HUM TGATTTAAGCCTTTTTGGTCATAAAACATTGAATTCTAATCTCCCTCTCAACCCTACAGT 2660 2670 2680 2690 2700 2710 EM_HUM CACCCATTTGGTATATTAAAGATGTGTTGTCTACTGTCTAGTATCCCTCAAGCAGTGTCA 2720 2730 2740 2750 2760 2770 >>EM_HUM:HSHBB U01317 HUMAN BETA GLOBIN REGION ON CHROMO (73308 nt) initn: 120 init1: 120 opt: 120 Z-score: 155.4 expect() 0.0063 100.000% identity in 24 nt overlap (1-24:1-24) 10 20 Sequen GAATTCTAATCTCCCTCTCAACCC :::::::::::::::::::::::: EM_HUM GAATTCTAATCTCCCTCTCAACCCTACAGTCACCCATTTGGTATATTAAAGATGTGTTGT 10 20 30 40 50 60 24 residues in 1 query sequences 824675648 residues in 113725 library sequences Tcomplib (4 proc)[version 3.2t09 December 7, 1999] start: Sun Aug 6 19:29:32 2000 done: Sun Aug 6 19:33:31 2000 Scan time: 832.400 Display time: 0.080 Function used was FASTA