Jump to content

Biologist/geneticist?


kevino

Recommended Posts

Is anyone on here by chance a biologist or geneticist of some nature? Im suppose to decipher 4000+ genes from chicken proteins and am having one helluva time doing it. My research professor is out of town and wants to see how far I've gotten when he gets back next week. Thanks.

Link to comment
Share on other sites

  • Replies 25
  • Created
  • Last Reply

Top Posters In This Topic

Top Posters In This Topic

...decipher 4000+ genes from chicken proteins ...

 

what do you mean?

 

This:

CGCGCCGCGCGGGCTGGAGGCGGCAGCGCCGCTGTGGAGAAGCCGCCGGGCGGAGGTTTGCTGTAAACTGCCTACAGTAGCTATAATGACACCAGCCCTC

MetThrProAlaLeu

 

AGAGAGGCAGCAACAAAAGGTCATGGTATCCACTTGTCACCTTCTTTGTCCTCTAGAGCTATGGAGTCTGATACAACTTTGTGCATGGAAAATTCCAGAGCA

ArgGluAlaAlaThrLysGlyHisGlyIleHisLeuSerProSerLeuSerSerArgAlaMetGluSerAspThrThrLeuCysMetGluAsnSerArgAla

 

GTGGAAGAGAAGATAAAAGAGGACTCCATTGCACGGATTACTTGCTCAGTCCTGGGGTTCCCCACTGCTGAGCCCAGCCTCAGGAATAATATCTTCAGCGTG

ValGluGluLysIleLysGluAspSerIleAlaArgIleThrCysSerValLeuGlyPheProThrAlaGluProSerLeuArgAsnAsnIlePheSerVal

 

CAGCATTTTGCTTCTCCACCATCCTCCAAGCACTATCAGTCTGTCTTGTTAATGAGTACCAACTCTGCGCTTAGCAGCAAAACCGGTAAGCAAATGAAATCG

GlnHisPheAlaSerProProSerSerLysHisTyrGlnSerValLeuLeuMetSerThrAsnSerAlaLeuSerSerLysThrGlyLysGlnMetLysSer

 

GGAGAGCCTGACTGCTCCAGGATGAGAAATGCATTACATAATGGCGCCGACGCATCGTTTGGTCAGATCAGTCATTCAGGACCTGAGGAACAGGTCAAAGGG

GlyGluProAspCysSerArgMetArgAsnAlaLeuHisAsnGlyAlaAspAlaSerPheGlyGlnIleSerHisSerGlyProGluGluGlnValLysGly

 

GAAGCTTTTTCAGAGACCACATCTCCGAATTTGGCAGAAACGCAAAGACTTTTGGATTCGAATGTAACTGAATCCGGCAACGCAGAAGAAGTGCAGCTCTTA

GluAlaPheSerGluThrThrSerProAsnLeuAlaGluThrGlnArgLeuLeuAspSerAsnValThrGluSerGlyAsnAlaGluGluValGlnLeuLeu

 

AATGATAAATGGTACAAGAAGAATGGTTTTTTGGGTAGGGCTCTGGGAGTCTGCGCCGAAACAATAAAAGGGGATTTATTACACCAAATTCTTCACGGGCCT

AsnAspLysTrpTyrLysLysAsnGlyPheLeuGlyArgAlaLeuGlyValCysAlaGluThrIleLysGlyAspLeuLeuHisGlnIleLeuHisGlyPro

 

TCAGAAGGGATTTTGAGCTGTGCCCGGGAGGAGGTGTACGCTCGCTTACTCCAGTGTGTCACTAAGCAACAAATGGAGATCAGCCGCGCCAAAAGAACTCAG

SerGluGlyIleLeuSerCysAlaArgGluGluValTyrAlaArgLeuLeuGlnCysValThrLysGlnGlnMetGluIleSerArgAlaLysArgThrGln

 

AAACGTTTACAAATGCTCCTGGCAAAGCATGTTATCAAACACTGTGATCAGCAGCTGAAGTGCTTCGTAAAACATCAGTTTCAAAGAATGAAGCTTTTACAC

LysArgLeuGlnMetLeuLeuAlaLysHisValIleLysHisCysAspGlnGlnLeuLysCysPheValLysHisGlnPheGlnArgMetLysLeuLeuHis

 

AAGCCAGCCAGGTTTTCGAGCAGTAGCTCCCTCAGGTGTGCGGACGGTTGGCCAGAAAACACTGCAGCTACTTTGGGAAGTAGTTCGAGTGCGGATGTACAG

LysProAlaArgPheSerSerSerSerSerLeuArgCysAlaAspGlyTrpProGluAsnThrAlaAlaThrLeuGlySerSerSerSerAlaAspValGln

 

AATGGAGTCTGTGTTGCACCGGGGGAGATCCGGGGGTTTGCCCTTTCTGCTGGAGGGCTGCTGTCTCGTGTGGAAAAGGATCTGGACTCCGATGCAACGTGT

AsnGlyValCysValAlaProGlyGluIleArgGlyPheAlaLeuSerAlaGlyGlyLeuLeuSerArgValGluLysAspLeuAspSerAspAlaThrCys

 

AGCAGCTCAGATGAAGACTATGATGAACAGACTGTAAGAACAGCTGTGGAAGCCAGCTATACTTCTGAATGGAAGTGGCTTGCAGACAGAGCTAGAATTGGC

SerSerSerAspGluAspTyrAspGluGlnThrValArgThrAlaValGluAlaSerTyrThrSerGluTrpLysTrpLeuAlaAspArgAlaArgIleGly

 

AGCCGTTGGACATGGCTTCAAGCCCAGATTTCAGAACTAGAATACAAAATCCAACAACTAACTGACCTTCACAGGCAGATACGTGCCACCAAGAGTGCACAA

SerArgTrpThrTrpLeuGlnAlaGlnIleSerGluLeuGluTyrLysIleGlnGlnLeuThrAspLeuHisArgGlnIleArgAlaThrLysSerAlaGln

 

CTGAGTGAAATCATCAGTAGCCTCATTGCTCCTCTCAACCTGTCTCCAGCTTCTGCTCTGTCATCCAAAACTTGCAGGCACAGACAGCTGGTCAATGGCATC

LeuSerGluIleIleSerSerLeuIleAlaProLeuAsnLeuSerProAlaSerAlaLeuSerSerLysThrCysArgHisArgGlnLeuValAsnGlyIle

 

TCCTTCAGGGCTTCAGACAACAGAGAAGTATCCTCTTCGAGCAGCTGGTTGCTTGATCATCAGCACATCAAGAAAAGAAGAAGAGACAGGACAAGACTAAGG

SerPheArgAlaSerAspAsnArgGluValSerSerSerSerSerTrpLeuLeuAspHisGlnHisIleLysLysArgArgArgAspArgThrArgLeuArg

 

TCTTTTTCCGTGACTAATGTCAGCACATCTGCCAGAACAAGACCACTTCACAGTTTCCAGAAGAGAAAACTCTACAGAATGCACGGTGCCTGTGACTGGAAT

SerPheSerValThrAsnValSerThrSerAlaArgThrArgProLeuHisSerPheGlnLysArgLysLeuTyrArgMetHisGlyAlaCysAspTrpAsn

 

CCGCAGACTTCATCATTTAGAGATACCTCCTGTCCGTACAGAACTCAGTTACCATGTGTGGTGCCACCCTCAGCATTGAGCAGTAGTGAGTACAGTCCGGAA

ProGlnThrSerSerPheArgAspThrSerCysProTyrArgThrGlnLeuProCysValValProProSerAlaLeuSerSerSerGluTyrSerProGlu

 

TCTAAAATGCTGGACTACGTGCAAGAGCTGGACTCTTCCTTCCATCCAGTCCTATCATTCCCTTCAGATATTCCTCTTCACATATACTTTGAAACATTATTA

SerLysMetLeuAspTyrValGlnGluLeuAspSerSerPheHisProValLeuSerPheProSerAspIleProLeuHisIleTyrPheGluThrLeuLeu

 

AGGAAGGATGACGTCAAGGGAGAACCTGTTGATGCCTCATATCCTGGAGTGGAGTTTAAAATAGCTCCAGAAAATGTGTCAGGAACATCTGAGCAGCTGTCG

ArgLysAspAspValLysGlyGluProValAspAlaSerTyrProGlyValGluPheLysIleAlaProGluAsnValSerGlyThrSerGluGlnLeuSer

 

GAAGGAAGAAAGAAAAGACATCTAAGTGAGACAGCAGTGGGCGAACGTAACACCAGGTTTGAGACGTTCTCCTTTCAACATGCAGAACCAGAATCCCCTAGC

GluGlyArgLysLysArgHisLeuSerGluThrAlaValGlyGluArgAsnThrArgPheGluThrPheSerPheGlnHisAlaGluProGluSerProSer

 

AGTTTTGCAGCCGTGACCAATATCAATGCGATGTCTAGGCCCACTCACAGCACTTCATCACAGCATAACTCCAGGAGGAGGCTGAGAAGTGAGAGCTCCTAT

SerPheAlaAlaValThrAsnIleAsnAlaMetSerArgProThrHisSerThrSerSerGlnHisAsnSerArgArgArgLeuArgSerGluSerSerTyr

 

GATATAGACAACATTGTTATTCCAATGTCACTGGTGGCACCGTCAAAGTTGGAGAAACTGCAGTACAAAGAAATCCTGACGCCAAGCTGGAGAGTTGTTGAA

AspIleAspAsnIleValIleProMetSerLeuValAlaProSerLysLeuGluLysLeuGlnTyrLysGluIleLeuThrProSerTrpArgValValGlu

 

TTAGAACCTTTGGAAAGATCTCATGCAGATGAAGAAGAGATCGTACGGCAAAAATGCTGATGGACGACACGGCCAAGATTCGGTGCAGAAGGACCACCCCGG

LeuGluProLeuGluArgSerHisAlaAspGluGluGluIleValArgGlnLysCysEnd

 

CAGCTCCTGTGCCTCGCTGCACTGCGCTGCTGAACCTGTTCCTGACCTGACTTCTGAAGCCCACAGCTCTGTTTGTTCAGGGATTGCACAACTCCGCAGGGA

GAGCCAGGAAGCAAAGTCTGGGCTGTGGGAACTTCGAGTTTTTCCACTAAAAGACGAAGAGGTGGAAGCTTTACTGTGCCAAGATCAAATAACAAATCAGAC

TGAAATGTCAAGTGCAGCTTTCCCCAGCGACTCTCCCTGTGCTCCATGTACGCCCATTGGTTTGCCAGACAATGGCCTTCCAGCACGAAAACAATCCACAGA

AGAGTCAGAAGAGAGCGAAAATGTTTGCCTGGGAATCAACAGTACAAGAAAGCAAAGGTGACAGGGAGTAATGTGGTTCATTAAGAAAGCCAATTAAGGTGG

AGAAAAGTGTAAGGTAAAATCTCTCTTGTTCACACAGCATATGACAGCACAATTCAAAACCTGTACGTTAGCTGCATATAATCTATTTTCTTTCTTGCTTTG

TAACAGGCAGGATAAATCCCAAAGACAACACTTTCTTCTTCCTGAACCCCAAGAGAAAGTAATTATTTCAAGCTGGAAGTCTAATATTCCCAGTGTTGGGAG

TGGGCACACATAAATTGCTGGCAGCATTCGCTGCTTTCTGTTGTCGTATCGCCACATTTGTTAGTGATCATCCTTTTTTGTCTTGTTTGTTTTCAAACGCGT

TTCTGGTTCCAGTCAATTCTGGTGCGGCAGGATATGCTGTCTACCTTAAACCAACAGCTTCAACTGAACACAGAAAAACTACTTGACAATATAGTGGGGAGG

AACGATAACCACTCATTATTTTCCTGCGTGGCGTGAACATGTGGCAGAAAATCCCCCTCTCCTCAAATGCACGTGTGTAACCTGTGCGTAAGAGAAGGGCAG

GTGAGAAGCAGCAGTTTGAAAGCAGGCGGTGTGGAAGCCTTGCCTTTAGCTTAAAATCTGTGCTCAGCTCTCAGTAGTGCCAGGGCGTGTTTTGTATAGACG

TCCTCAAGCAATAAGATAGAAATAGACCACTGCATTTCAATAGTCCACGTATCTGCTACTCAGAAAGTAGGGTTCCTGTTTGTCATATGGGTGTAGAGAAGT

GGAAGAGAAGAGGAGCTCTCTGAATATTGCATTCTGGGTCATCTCCCTTCTTTAGTCCTTCAGTAGCAAAGGAGGCTCGTAATATTTTGCTTTCATATGTTA

AAGACAAAAAGTCTTGTAATTTCTGTACAGTGCCTTGCAAGAGTCTCTAAGTATGCCAACATTTGGTGACTGGCTGAAAAGGAAGAGATGGCTGATGGCTAG

AAGACCAGTAAGGAAAAAAGCCCAAAAGCTCTCCGCTGTGATGCTGGGTGGAGTTCTAATTTCCCAGCGAGTTCACAGTGAGTAAAGAAAAGCAAGTTGTTT

CAACTTGCATCTAAATAGGATAGTTTGGAATGTATCTGCAATCTCAAGAGGTATTAACAAAGATTAAGCATTGAATGTAGGTATAATTTACCGAAGGGAAGT

TCTTTCTGTGTAGGACAGTTGAAGGCAATATTATAGGAGTCTCTGTTTTTGTGCACAAGTTTCTTCTGTGTGTAATATCTGGAACTGAACGGCACTGGACAG

ATGACCCAGTTATAGTGTCTTCTTTTTTTTTTTGTCAGAATGACAGGAAATATACACTGGACATCTGAAAACAGTTCAGTAAAGTTTCAACACTAAAAAAAA

 

Link to comment
Share on other sites

LOCUS XM_421860 2854 bp mRNA linear VRT 16-NOV-2006

DEFINITION PREDICTED: Gallus gallus similar to hypothetical protein FLJ23861

(LOC424004), mRNA.

ACCESSION XM_421860

VERSION XM_421860.2 GI:118093283

KEYWORDS .

SOURCE Gallus gallus (red jungle fowl)

ORGANISM Gallus gallus

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;

Archosauria; Dinosauria; Saurischia; Theropoda; Coelurosauria;

Aves; Neognathae; Galliformes; Phasianidae; Phasianinae; Gallus.

COMMENT MODEL REFSEQ: This record is predicted by automated computational

analysis. This record is derived from an annotated genomic sequence

(NW_001471728) using gene prediction method: GNOMON, supported by

mRNA and EST evidence.

Also see:

Documentation of NCBI's Annotation Process

 

On Nov 16, 2006 this sequence version replaced gi:50750076.

FEATURES Location/Qualifiers

source 1..2854

/organism="Gallus gallus"

/mol_type="mRNA"

/strain="inbred line UCD001"

/isolate="#256"

/db_xref="taxon:9031"

/chromosome="7"

/sex="female"

/common="red jungle fowl"

/note="inbred line derived from a wild population of red

jungle fowl in Malaysia in the late 1930s, with the

possible introgression of a limited amount of White

Leghorn genome during its captive breeding history"

gene 1..2854

/gene="LOC424004"

/note="Derived by automated computational analysis using

gene prediction method: GNOMON. Supporting evidence

includes similarity to: 2 mRNAs, 5 ESTs, 1 Protein"

/db_xref="GeneID:424004"

CDS 107..2854

/gene="LOC424004"

/codon_start=1

/product="hypothetical protein"

/protein_id="XP_421860.2"

/db_xref="GI:118093284"

/db_xref="GeneID:424004"

/translation="MESDTTLCMENSRAVEEKIKEDSIARITCSVLGFPTAEPSLRNN

IFSVQHFASPPSSKHYQSVLLMSTNSALSSKTGKQMKSGEPDCSRMRNALHNGADASF

GQISHSGPEEQVKGEAFSETTSPNLAETQRLLDSNVTESGNAEEVRLLNDKWYKKNGF

LGRALGVCAETIKGDLLHQILHGPSEGILSCAREEVYARLLQCVTKQQMEISRAKRTQ

KRLQMLLAKHVIKHCDQQLKCFVKHQFQRMKLLHKPARFSSSSSLRCADGWPENTAAT

LGSSSSADVQNGVCVAPGEIRGFALSAGGLLSRVEKDLDSDATCSSSDEDYDEQTVRT

AVEASYTSEWKWLADRARIGSRWTWLQAQISELEYKIQQLTDLHRQIRATKGMVILEE

FPFPKDILKKQIQLTDQEALLNATGNSQAAIERQDSLPEHDFEMSPSSPTLLLRNIEK

QSAQLSEIISSLIAPLNLSPASALSSKTCRHRQLVNGISFRASDNREVSSSSSWLLDH

QHIKKRRRDRTRLRSFSVTNVSTSARTRPLHSFQKRKLYRMHGACDWNPQTSSFRDTS

CPYRTQLPCVVPPSALSSSEYSPESKMLDYVQELDSSFHPVLSFPSDIPLHIYFETLL

RKDDVKGEPVDASYPGVEFKIAPENDYNHHNVALKRWNNSCLSNSKSQSVSGTSEQLS

EGRKKRHLSETAVGERNTRFETFSFQHAEPESPSSFAAVTNINAMSRPTHSTSSQHNS

RRRLRSESSYDIDNIVIPMSLVAPSKLEKLQYKEILTPSWRVVELEPLERSHADEEEA

EDLSDEVFSSRHTKYEERERARWSLWEQSRWPRRNSRSYGKNADGRHGQDSVQKDHPG

SSCASLHCAAEPVPDLTSEAHSSVCSGIAQLRRESQEAKVGGSGSFQSVQDLSDLRFF

S"

ORIGIN

1 ggtttgctgt aaactgccta cagtagctat aatgacacca gccctcagag aggcagcaac

61 aaaaggtcat ggtatccact tgtcaccttc tttgtcctct agagctatgg agtctgatac

121 aactttgtgc atggaaaatt ccagagcagt ggaagagaag ataaaagagg actccattgc

181 acggattact tgctcagtcc tggggttccc cactgctgag cccagcctca ggaataatat

241 cttcagcgtg cagcattttg cttctccacc atcctccaag cactatcagt ctgtcttgtt

301 aatgagtacc aactctgcgc ttagcagcaa aaccggtaag caaatgaaat cgggagagcc

361 tgactgctcc aggatgagaa atgcattaca taatggcgcc gacgcatcgt ttggtcagat

421 cagtcattca ggacctgagg aacaggtcaa aggggaagct ttttcagaga ccacatctcc

481 gaatttggca gaaacgcaaa gacttttgga ttcgaatgta actgaatccg gcaacgcaga

541 agaagtgcgg ctcttaaatg ataaatggta caagaagaat ggttttttgg gtagggctct

601 gggagtctgc gccgaaacaa taaaagggga tttattacac caaattcttc acgggccttc

661 agaagggatt ttgagctgtg cccgggagga ggtgtacgct cgcttactcc agtgtgtcac

721 taagcaacaa atggagatca gccgcgccaa aagaactcag aaacgtttac aaatgctcct

781 ggcaaagcat gttatcaaac actgtgatca gcagctgaag tgcttcgtaa aacatcagtt

841 tcaaagaatg aagcttttac acaagccagc caggttttcg agcagtagct ccctcaggtg

901 tgcggacggt tggccagaaa acactgcagc tactttggga agtagttcga gtgcggatgt

961 acagaatgga gtctgtgttg caccggggga gatccggggg tttgcccttt ctgctggagg

1021 gctgctgtct cgtgtggaaa aggatctgga ctccgatgca acgtgtagca gctcagatga

1081 agactatgat gaacagactg taagaacagc tgtggaagcc agctatactt ctgaatggaa

1141 gtggcttgca gacagagcta gaattggcag ccgttggaca tggcttcaag cccagatttc

1201 agaactagaa tacaaaatcc aacaactaac tgaccttcac aggcagatac gtgccaccaa

1261 ggggatggtg atcttagaag aattcccatt tccaaaagac attttgaaga agcaaataca

1321 gttgacagac caagaagctt tattaaacgc cacagggaat tcgcaagctg ccattgagag

1381 acaggattct ttgccggagc atgactttga aatgtcaccc agcagtccta ccctgctttt

1441 acgaaacata gaaaaacaga gtgcacaact gagtgaaatc atcagtagcc tcattgctcc

1501 tctcaacctg tctccagctt ctgctctgtc atccaaaact tgcaggcaca gacagctggt

1561 caatggcatc tccttcaggg cttcagacaa cagagaagta tcctcttcga gcagctggtt

1621 gcttgatcat cagcacatca agaaaagaag aagagacagg acaagactaa ggtctttttc

1681 cgtgactaat gtcagcacat ctgccagaac aaggccactt cacagtttcc agaagagaaa

1741 actctacaga atgcacggtg cctgtgactg gaatccgcag acttcatcat ttagagatac

1801 ctcctgtccg tacagaactc agttaccatg tgtggtgcca ccctcagcat tgagcagtag

1861 tgagtacagt ccggaatcta aaatgctgga ctacgtgcaa gagctggact cttccttcca

1921 tccagtccta tcattccctt cagatattcc tcttcacata tactttgaaa cattattaag

1981 gaaggatgac gtcaagggag aacctgttga tgcctcatat cctggagtgg agtttaaaat

2041 agctccagaa aatgactata atcaccataa tgttgctctt aaacgatgga acaatagctg

2101 tttatctaat tccaaatctc aatcagtgtc aggaacatct gagcagctgt cggaaggaag

2161 aaagaaaaga catctaagtg agacagcagt gggcgaacgt aacaccaggt ttgagacgtt

2221 ctcctttcaa catgcagaac cagaatcccc tagcagtttt gcagccgtga ccaatatcaa

2281 tgcgatgtct aggcccactc acagcacttc atcacagcat aactccagga ggaggctgag

2341 aagtgagagc tcctatgata tagacaacat tgttattcca atgtcactgg tggcaccgtc

2401 aaagttggag aaactgcagt acaaagaaat cctgacgcca agctggagag ttgttgaatt

2461 agaacctttg gaaagatctc atgcagatga agaagaggca gaagatctgt cagatgaagt

2521 gttttcctca cgccatacca agtatgaaga aagagagcga gcgaggtggt cgctgtggga

2581 gcagagccgc tggcccagaa ggaacagcag atcgtacggc aaaaatgctg atggacgaca

2641 cggccaagat tcggtgcaga aggaccaccc cggcagctcc tgtgcctcgc tgcactgcgc

2701 tgctgaacct gttcccgacc tgacttctga agcccacagc tctgtttgtt cagggattgc

2761 acaactccgc agggagagcc aggaagcaaa ggtgggtggt tctggtagct tccagtccgt

2821 tcaggacttg agtgacctcc ggttctttag ctaa

//

 

 

Link to comment
Share on other sites

if I could be an enzyme, I would be DNA Helicase so I could unzip your genes

 

are you saying you have to give a DNA sequence, just by translating from the protein sequence? it's not so simple since there are many DNA sequences that all code for the same amino acid

 

 

Jesus, I love nerds and all their nerdy-ass silly jokes!

Link to comment
Share on other sites

So you have a bunch of DNA sequence tags from an affymetrix chip and you need to know what the proteins sequences for each are?

 

or

 

You have some protein sequences you got from _________ and you need the complete gene sequences for each? Or maybe you just need the gene names?

 

Am I getting close?

 

I still don't understand what it is you are trying to do.

Link to comment
Share on other sites

Join the conversation

You can post now and register later. If you have an account, sign in now to post with your account.

Guest
Reply to this topic...

×   Pasted as rich text.   Paste as plain text instead

  Only 75 emoji are allowed.

×   Your link has been automatically embedded.   Display as a link instead

×   Your previous content has been restored.   Clear editor

×   You cannot paste images directly. Upload or insert images from URL.




×
×
  • Create New...