User:Dennis9600: Difference between revisions

From Eterna Wiki

No edit summary
No edit summary
Line 1: Line 1:
<p><img title="Smile" src="/wiki/extensions/TinyMCE_MW/jscripts/tiny_mce/plugins/emotions/img/smiley-smile.gif" border="0" alt="Smile" /> <strong><span style="font-size: medium;">I have just added Vienna 2.1.1 dot plots to my publications.&nbsp; I wanted to incorporate them into the report files, but it was much more convenient to just make a separate file for each submitted design</span>.<span style="font-size: medium;">&nbsp; Giving each one a unique file name that was acceptable to Python, the Windows operating system, Ghostscript, and Google Drive required a little name mangling</span>.&nbsp; </strong></p>
<p>&nbsp;</p>
<p><img title="Innocent" src="/wiki/extensions/TinyMCE_MW/jscripts/tiny_mce/plugins/emotions/img/smiley-innocent.gif" border="0" alt="Innocent" /><strong><span style="font-size: medium;"> There are a huge number of files this time.&nbsp; I haven't had time to look at everything I uploaded to the ReportsActiveLabs folder.&nbsp; Please PM me if you spot any problems.</span></strong></p>
<p>&nbsp;</p>
<p><span style="font-size: medium;"><strong>Analysis Reports and Spreadsheets produced by my Python script can be found in <a href="https://drive.google.com/folderview?id=0B-rDnoMjFSH2cDFibWt6T19nRzg&amp;usp=sharing">ReportsActiveLabs</a>.&nbsp; You can bookmark this folder in your browser.&nbsp; I have decided to stick with one folder rather than change with each publication cycle.&nbsp; <br /></strong></span></p>
<p><span style="font-size: medium;"><strong>Analysis Reports and Spreadsheets produced by my Python script can be found in <a href="https://drive.google.com/folderview?id=0B-rDnoMjFSH2cDFibWt6T19nRzg&amp;usp=sharing">ReportsActiveLabs</a>.&nbsp; You can bookmark this folder in your browser.&nbsp; I have decided to stick with one folder rather than change with each publication cycle.&nbsp; <br /></strong></span></p>
<p>&nbsp;</p>
<p>&nbsp;</p>
<p><span style="font-size: small;"><strong>In this edition:</strong></span></p>
<p><span style="font-size: small;"><strong>In this edition of my analysis reports and spreadsheets (new items are bolded):</strong></span></p>
<p><span style="font-size: small;"><strong>&nbsp; 1) If the target structure contains locked, non-Canonical pairs, the Vienna tools are called with a --nsp option that allows the pairing and assigns an energy of 0 to it.&nbsp; Zero may not</strong></span></p>
<p><span style="font-size: small;"><strong>&nbsp; 1) Every lab has it's own subfolder now</strong></span></p>
<p><span style="font-size: small;"><strong>be the best value to use, but it seems to be better than not doing anything.<br /></strong></span></p>
<p><span style="font-size: small;"><strong>&nbsp; 2) Vienna 2.1.1 dot plots for each submitted design!!!<br /></strong></span></p>
<p><span style="font-size: small;"><strong>&nbsp; 2) Some of the labs now have a starting sequence of 'GG' instead of 'GGAAA'.&nbsp; It looks like the devs have left things open for additional "tails" to be used in future labs.&nbsp; For now, my script</strong></span></p>
<p><span style="font-size: small;">&nbsp; 3) If the target structure contains locked, non-Canonical pairs, the Vienna tools are called with a --nsp option that allows the pairing and assigns an energy of 0 to it.&nbsp; Zero may not</span></p>
<p><span style="font-size: small;"><strong>recognizes both of the sequences that have appeared and uses the correct one for each lab.<br /></strong></span></p>
<p><span style="font-size: small;">be the best value to use, but it seems to be better than not doing anything.<br /></span></p>
<p><span style="font-size: small;">&nbsp;3)&nbsp; There is a field for the Vienna 2.1.1 melt point of each design submitted.&nbsp; This new field now appears in both the text reports and the spreadsheets.<br /></span></p>
<p><span style="font-size: small;">&nbsp; 4) Some of the labs now have a starting sequence of 'GG' instead of 'GGAAA'.&nbsp; It looks like the devs have left things open for additional "tails" to be used in future labs.&nbsp; For now, my script</span></p>
<p><span style="font-size: small;">recognizes both of the sequences that have appeared and uses the correct one for each lab.<strong><br /></strong></span></p>
<p><span style="font-size: small;">&nbsp; 5)&nbsp; There is a field for the Vienna 2.1.1 melt point of each design submitted.&nbsp; This new field now appears in both the text reports and the spreadsheets.<br /></span></p>
<p><span style="font-size: small;">My lab tool includes a forecasting tool that looks at factors that have correlated well with past synthesis scores.&nbsp; The current version of the forecasting tool looks at the following factors:</span></p>
<p><span style="font-size: small;">My lab tool includes a forecasting tool that looks at factors that have correlated well with past synthesis scores.&nbsp; The current version of the forecasting tool looks at the following factors:</span></p>
<p><span style="font-size: small;">&nbsp;&nbsp;&nbsp; a) Whether or not the design folded correctly in EteRNA's energy model (Vienna 1.8.5)</span></p>
<p><span style="font-size: small;">&nbsp;&nbsp;&nbsp; a) Whether or not the design folded correctly in EteRNA's energy model (Vienna 1.8.5)</span></p>
Line 19: Line 25:
<p>&nbsp;</p>
<p>&nbsp;</p>
<p><strong><span style="font-size: medium;">Things to do:</span></strong></p>
<p><strong><span style="font-size: medium;">Things to do:</span></strong></p>
<p><span style="font-size: large;">1. I am (still) thinking about how to add Vienna 2.1.1 and EteRNA melt curves and dot plots to these reports.&nbsp; I can modify my scripts to generate these, but they are in Postscript format.&nbsp; I am investigating Ghostscript and ImagMagick as possible tools to incorporate into my scripts to convert the graphics.&nbsp; I've been informed that EteRNA dot plots aren't strictly the same as Vienna 1.8.5 dot plots.&nbsp; I will need to incorporate the modifications that EteRNA made to Vienna 1.8.5 dot plots to get it right.</span></p>
<p><span style="font-size: large;">1. I am (still) thinking about how to add Vienna 2.1.1 melt curves&nbsp; to these reports.&nbsp;&nbsp;</span></p>
<p><span style="font-size: large;">2. As of 11/10/2013, a new round of synthesis results is available.&nbsp; I need to run the training portion of my forecasting tool on the new data.</span></p>
<p><span style="font-size: large;">2. Investigate what changes EteRNA made to Vienna 1.8.5 dot plots.</span></p>
<p><span style="font-size: large;">3. Investigate Vienna's "ensemble diversity measure" as another possible factor for the forecasting tool to consider.</span></p>
<p><span style="font-size: large;">2. As of 11/10/2013, about three new round of synthesis results are available.&nbsp; I need to run the training portion of my forecasting tool on the new data.</span></p>
<p><span style="font-size: large;">3. Investigate Vienna's "ensemble diversity measure" as another possible factor for the forecasting tool to consider.&nbsp; I'm capturing the diversity measure now,</span></p>
<p><span style="font-size: large;">but not doing anything with it yet.<br /></span></p>
<p><span style="font-size: large;">4. Look for published energy parameters for non-Canonical pairs in RNA sequences.&nbsp; There's got to be something better than 0 out there.</span></p>
<p><span style="font-size: large;">4. Look for published energy parameters for non-Canonical pairs in RNA sequences.&nbsp; There's got to be something better than 0 out there.</span></p>
<p><span style="font-size: large;">5. Look at "upgrading" to Vienna 2.1.3 (from Vienna 2.1.1) in my toolset.<br /></span></p>
<p><span style="font-size: large;">5. Look at "upgrading" to Vienna 2.1.3 (from Vienna 2.1.1) in my toolset.<br /></span></p>

Revision as of 04:15, 22 November 2013

<img title="Smile" src="/wiki/extensions/TinyMCE_MW/jscripts/tiny_mce/plugins/emotions/img/smiley-smile.gif" border="0" alt="Smile" /> I have just added Vienna 2.1.1 dot plots to my publications.  I wanted to incorporate them into the report files, but it was much more convenient to just make a separate file for each submitted design.  Giving each one a unique file name that was acceptable to Python, the Windows operating system, Ghostscript, and Google Drive required a little name mangling

 

<img title="Innocent" src="/wiki/extensions/TinyMCE_MW/jscripts/tiny_mce/plugins/emotions/img/smiley-innocent.gif" border="0" alt="Innocent" /> There are a huge number of files this time.  I haven't had time to look at everything I uploaded to the ReportsActiveLabs folder.  Please PM me if you spot any problems.

 

Analysis Reports and Spreadsheets produced by my Python script can be found in <a href="https://drive.google.com/folderview?id=0B-rDnoMjFSH2cDFibWt6T19nRzg&usp=sharing">ReportsActiveLabs</a>.  You can bookmark this folder in your browser.  I have decided to stick with one folder rather than change with each publication cycle. 

 

In this edition of my analysis reports and spreadsheets (new items are bolded):

  1) Every lab has it's own subfolder now

  2) Vienna 2.1.1 dot plots for each submitted design!!!

  3) If the target structure contains locked, non-Canonical pairs, the Vienna tools are called with a --nsp option that allows the pairing and assigns an energy of 0 to it.  Zero may not

be the best value to use, but it seems to be better than not doing anything.

  4) Some of the labs now have a starting sequence of 'GG' instead of 'GGAAA'.  It looks like the devs have left things open for additional "tails" to be used in future labs.  For now, my script

recognizes both of the sequences that have appeared and uses the correct one for each lab.

  5)  There is a field for the Vienna 2.1.1 melt point of each design submitted.  This new field now appears in both the text reports and the spreadsheets.

My lab tool includes a forecasting tool that looks at factors that have correlated well with past synthesis scores.  The current version of the forecasting tool looks at the following factors:

    a) Whether or not the design folded correctly in EteRNA's energy model (Vienna 1.8.5)

    b) Whether or not the design folded correctly in the Vienna 2.1.1 Energy Model

    c) The melt point of the design (as reported by the EteRNA server)

    d) The percentages of C,U, and G in the design, and how far they differ from 13, 10, and 21% respectively (the Berex Strategy).

    e) log10(designer's EteRNA points).  This factor is given a very low weight.

 

Things I have looked at:

I have also looked at the temperature setting of the energy model as a possible factors for my forecasting tool.  I was suprised to find that the default setting of 37C is the best setting for my forecasting tool.

 

 

Things to do:

1. I am (still) thinking about how to add Vienna 2.1.1 melt curves  to these reports.  

2. Investigate what changes EteRNA made to Vienna 1.8.5 dot plots.

2. As of 11/10/2013, about three new round of synthesis results are available.  I need to run the training portion of my forecasting tool on the new data.

3. Investigate Vienna's "ensemble diversity measure" as another possible factor for the forecasting tool to consider.  I'm capturing the diversity measure now,

but not doing anything with it yet.

4. Look for published energy parameters for non-Canonical pairs in RNA sequences.  There's got to be something better than 0 out there.

5. Look at "upgrading" to Vienna 2.1.3 (from Vienna 2.1.1) in my toolset.

 

Please let me know if you have any other things you would like to see in my lab reports.

 

Happy folding,

---Dennis9600