Skip to main content

Table 3 Gene prediction from the putative new transcripts. The table shows the chromosome, position, number of exons, length and sequence of the putative new transcripts. In addition, InterPro prediction and observation from BLAST against NCBI nr database are indicated

From: New insights into genome annotation in Podospora anserina through re-exploiting multiple RNA-seq data

Predicted_CDS

Chromosome

Position

exon

length (aa)

sequence

InterPro

Observation

1

1

4,418,621

2

63

MARKGSNKREIYTRSSLSASLYSWLAGGGGSSGGVVAVDHDADAADDSDDAGDDAGDAGD

GRG

nothing

Ā 

2

1

5,197,730

2

170

MSSTERLLTLAMPLKHEMELMELDVSPKPISVTSSEDNDATAKPHHNEQNRRYSTTNRLD

PQEAEPAALEETSYAQQTTDPETTWSPDGADDEGFHVEDDNECASTLPPSPELEAVEDEE

MAGWVKEQQTSQPSLRLYRGSPGDILVMARTAAPSFPTYYPRELNMDDEC

nothing

Ā 

3

1

5,382,147

3

165

MKTAILLAVLFVGASSLPVAPSLEAKVYLSLSFPTVDATSTRAQWVSSYGNKPPKKAQEN

EPTVDGTSTRGQWVSSHGNNPPKKFQEGPTVDATSTRAQCVSSYGNKPPKKGASFSDHWL

PVGPGYLDLDALESGYHAHLMIMHDSTSWLQSGMTAIKRFMRRFA

Signal peptide, transmembrane domain

Ā 

4

1

5,504,377

1

395

MADLIDLSQQSPSSLSPWEPSVLLQLWNPDVQAWSCLGWTRAERRCRRVLSQAKREATMR

ILPDLGLSGSHDVDFETELLGELSHECLCRYHSDDETAKTLVEQWKTALKKARSQYQKTE

RTTSKESLSTDSVAHHARTQSPPLSSTESEETTEVMLKDHPEDSTVLKQEAIVEQTIEQS

TSASSPRLNSTESASPPAGKTPPKLGFSTPVPGATPSKSPEVSKSTPLFDFTQLFQTPGS

HKPTVNPHKGTPAPAAFKYDQSRTPGTSSTADMSHSSVASLSPNENTPAFVFTSSSTPSR

PPATTEGSLPKSEDPFRYSGSPIREAGQRIIKSLREIGDMEVPNELDGDISGLGQSIERL

RLRLEKGRSLCLSGPDAGSEGDDETSDQDGKRRME

nothing

nothing significant found with BLASTp against NCBI database

5

1

7,676,610

1

212

MPASDITIDLDRSSTPFPRSRRSERAHEQVEDFNRRSSSPRRFTSTPTRSRRLSPSPTRY

RSAATIARSPPRYRSPTRITSAPRPRRISPSPLRHRSPIRLPSSTSRRAPFPSPYFKETR

ITEARRTTTYHPTSPVSRYTETRTTYRTPISRPRTPPRGSIGSPSRRITSPRLVDIRSSS

EIYSSSRRYDSYPSTTRSPGRSTDRSLFTRRY

nothing

Ā 

6

1

7,835,241

3

94

MSGYPPQQGGYYPQAPPQGYPPQGYPPPDGGYPPQGYPPQGYPPPQQQMQYQQAPPPKEE

KSHGCLYTCVAAMCCCWLCGETCECCLECLDCCF

rhodopsin C-terminal tail

nothing significant found with BLASTp against NCBI database

7

2

1,526,413

2

214

MANIKTETADEGVTAADPGAIKKAPFSMTESELREILVLAIDRHPAIHPIVQRHLDRLRD

NNLGGFQDDFEKIRCEVYACASEPCFSDPKAIGACIKRYFEKLLNVATRESPYETRYSAV

EWFLRVLNLLVFTSDPHDVRKEIWSHTDGSCLKLVMLVCRFRTAERGRLLRDHNSLIMLK

MDLITANAKDLPELLREFEPTIHVIKSWRAESRG

nothing

Ā 

8

3

822,817

1

357

MGYEWNGDPTILIVVIACSVCFGWVPIITVVSIVRHCRARLRAKRGSNGTNSDAESQGGR

PSTAPDVPKPLQTYHPSSTKGLERSASSRTRSSADGYDLKRVDTNSSWNPIRHSFHYDNE

SLWGGDGLSRSNSRHRPPYFPTHVHNTTPSLSRPASIRSVASSHRQQSRSRRSSMASNSD

NAPAAFQINDTYYDTTPLPNVTRTVNPVVASSSTPTSSKGPGQAPQQRQQKQPKQDNPHP

PQRNRRTRHSLDARGDSDSLTRDISRPNTSMTRREVEEYEDLDNQKQKATHRSHRPPRPG

SASRRGSHSAPGGSEETDDDLSMAGALPPAKLPPRRASLHAQTFERPAWLHEEPHAM

transmembrane domain

nothing significant found with BLASTp against NCBI database

9

3

1,121,553

2

317

MHDCEFEENPAGFCCAVETVELHAAGRSYFYSSFEGASCYRQDFAFFRNLQHISLRNFFD

DPNRSRQQTVQLLRHSPNLHRLELGLSAKAVVRQLEREGSFGVFVHFFDRLCDEYAESGG

QPLRLTHLGLFDAMWVWKPESLRKKPADLAFLQEVRLNTETIEDCITDNLVDLFDSEALS

GYAVLVETDRGSKYGPAYLVGARELEMRRPRTPMQLAEMSLVLGGTWGNQKLLAATTRHS

LQGLVVNMNRPDPRRSLDFLLAPLQNMHRLARLWIVSANMYKDLPLLTKAAQKGGCRVSC

LALHRDRVALLGGNWQN

nothing

nothing significant found with BLASTp against NCBI database

10

3

3,727,340

4

215

MEPRETYREFGSRAAGRHRRKTLGSTRQTVRDSCKVNGGNDDTSWLRTPPPHCPCQKLPV

TTGLREEEYAGKQTQAREESDGMWVVVERKEGFDRQPGGDGCFQSGERKRGFSGGWRRLS

STTTTTTTTTTATTATTTTTTTTDEEGEHREEQQETEETGGCGGGSKARPHSLGTVEEKK

PNKKMAGDYPEVLRKFSLPLSVQGIGSMGLGFPMP

nothing

Ā 

11

4

2,328,575

2

69

MVISMTQRNIPGIWRSGGGRGQDNSAPLPQLQQQQQQQQQQQQQQQQQQQQQQQQQQQQQ

PQPQPRLQQ

nothing

Ā 

12

4

3,371,922

2

373

MCQGTIYDFWCPCIFHAPSTSFYLQFDIHPPDFNYTFTRRPTTNPLKAHLSKSSHSIVYS

QHCAAYKFCDDYLHSEGFNPGDVFDMGGLCPAGHQVTYEREAFISSRLCDACISGKCEEN

MEFAGVKTVRRSRYGWRSREEEREGKRRSRSRPGRGVSPAGSVRSFDSTGRGRSSSVGST

RTVKGRDMGVEKGGVAGEGEGKTLGAMNLKNLVDKMVQTVSALRVGGGAERQDQPRVMPA

SDLEAMAEESMPTPLPSRHKPSGKNLEDMFDNSGRPEYDSDQDTVVGASKTTEKKSKVNG

KTIAADEISGVMQEIPTGRSKSRKRRMWTDPRTDEEASRVLRFLRRGKGAAPVETGNSRE

RSRGQGYERITIE

nothing

nothing significant found with BLASTp against NCBI database

13

5

484,874

2

126

MQLTRSLSTALVALLLSSIATGHRIPAQSEELQLRDAAPAEVNETGTPPVVLPVDDTLSA

DVIVDETEHGSLVGRAVHPRQLGKGKGGGKGKGGGKGKGGAKGKGGGKGKGGKGKGGKGK

GGKGKG

signal peptide

Ā 

14

5

1,699,000

5

503

MTPDVDKPNRTIPNLQKQLSVEREEKELKEAQYQFRIQELQDEINSLRDNEHESISTGCP

QPEPGTTSVNREDIVVRAMLRGTSPAMLHQEGTIALPLSESPRLSVSHSEDHYEWKDNIT

ALALTSQGEDTPKVAYKVEEGSQNDESDFDEVDYRIPMKGKEKWKAAVTSERYKYREQKD

REYREALNKQHVDGSDRILRMDELVAEGNQPWSTFNMRHTLKATTAHDIPLQSSKSHPVE

SHDVPLSDHDWISGKHPDDPRAEDRLAPEDVDVKLAPLKDDTAMGSVPDLGYGLPRELSI

RPQNESKTDDGNIQEDQSDNQTVYSDDGSIDGDILNVCKTELADSLANHIRQLEVGPEGY

ANITRKLPPLLKAFALRVGIQAHRGCRGMLCSLCTSIATKPASEYNGTSTEALGSRIINW

IQHDEDSDSTGLEQQPSKETPDELPVEVEADGINFLPDDHQGLAHQRNHHSSNFGEHSGV

SWLLALWCETPKLWSFGMVIVAI

nothing

nothing significant found with BLASTp against NCBI database

15

6

2,341,619

2

81

MTEDLHRDITERLRCLELQIRITSHMFIGVAQNAGDDPTNLVKVKDEMLGKLQEMRYEEE

RLARERLAALKQRVPSAGNSD

nothing

Ā 

16

6

2,662,998

1

382

MSAPLLMHPAEPATADNTKPRLACPFFRYDPCRHYACASYELKGFEAVKKHLERKHILKN

HCARCFRSFESEDARNNHIVSECCSIALGRDEITYDEWTRARRCPRTKSCEVKWKWLWTT

FFKLPALPRELVYFQDAVVEAKNVLIDPVTIQSVLKARLHLDQQEISSVADEVREALLRK

NSGARPYRVCDSEGGGDNGIPANLKASGYGSMGGGAAEMEAEAVAFALPPARHALLPEEP

CLPIIGESSPHPAAAVSPVTPLPTSFSLGPILVPQQPASTSGGGPETNTFDAWRTVCLVP

WATADGILARLMEDPISWFKPDGPKWSDVYDHIDRDALRKFWALGNTPAVQVSIPIRSTH

VQSLAAIESKLFDFEVAGIRPS

nothing

Ā 

17

6

2,780,217

1

214

MDTKDEDSAQQQSSPLLPISNHPPSSRPRTPILLKLETNLPLVTPAQPPETTPQETWDYP

TSLRQLTALLLFTLQLLILITYHPSFLSLLPIPGPLSNHHCLLLADTIITCLAIIISSYV

HFCIASLDCELLEQGWKPVYFYIMAADETVILLAAASSGLENVCSWGLFVVTVGSWYVGW

RLGAVEVLSRRLFRAEGWEFGQGEGEEGRGLRVV

transmembrane domain

Ā 

18

6

4,158,427

4

64

MACDSHGRQPSEFALVHEALPRDIHLPTCIHASPKRKTVSSSDTKPRRFLLHTQGVTSGP

RACG

nothing

Ā 

19

7

440,886

8

626

MVEGVRAFDKLDWKDDVAFCSLTEDMEEAVGPGDEVFVCSNQDGMTGSWEMIHNSSSFGA

PPITSELFENANEEPMIDPAVLGDTWSQMKAWATLCGIKDDPIAPGIAELLEIEEQESGD

GGFCCYGTISHAEVKLVGNLAESRDRLLNNEHVQSFAVIKHDDYLMVIFSDNHIFAQVNE

AVSQALTSLFNKFKFFEVKAFAQIGKIQSLFYQSHTPGQAKLRVDINIYGSAADADAVGL

YLGSTAKLYLQDPEYGTENIEYLNRQLIHFPGFEEPKVFAGPGADFANKTSKALQGVRSQ

REHFDQTLSQILLTSRSHHVLVVGRNQKRPQTTLFKAACEIRANFGWCLTATPIQNRLEE

LGSPLAFLPIDQLQNRAMFKKKIMDASSPDAHTMLELPPIEERYHYITLSQEERNRYDKT

AADMSNWINHKTGLHVLTPNSGDDNNDKVDHFDLSGVSSKIEVLIRHLQQTPRDTKRYVG

SARLAEVLENQAYINSPSIVFSCWTKTLDLVALHLTRMKILHQRIDGRQKLAERQHNMSR

FVSDEGTSVPVLLTTTGVGAFGLNLTAANHVYILEPQWNPSVESQALSRVARRGQKKTVL

VTRYLVHGTVEILRKMRLAEAGWATP

transcription facteur SNF2 related, DNA binding domain, ATP binding site

Transcript overlap two features now annotated as pseudogenes

20

7

3,133,570

2

52

MPPKILSEKHEALRQDVNAKMNKFELRINRKVDDHMQLRDMFHDRREATSFS

nothing

Ā