{"id":22,"date":"2025-06-04T14:11:24","date_gmt":"2025-06-04T20:11:24","guid":{"rendered":"https:\/\/courses.cs.colostate.edu\/cs003\/?page_id=22"},"modified":"2026-01-12T14:51:49","modified_gmt":"2026-01-12T21:51:49","slug":"syllabus","status":"publish","type":"page","link":"https:\/\/courses.cs.colostate.edu\/cs525\/syllabus\/","title":{"rendered":"Syllabus"},"content":{"rendered":"\n<h2 class=\"wp-block-heading\">Course description<\/h2>\n\n\n\n<p>Modern biological techniques are generating a variety of large-scale data, and require sophisticated algorithms for their analysis. Tasks include assembling the genomic sequence, predicting the protein-coding regions, their function and the manner in which they are regulated. The course will provide a broad overview of computational techniques currently used in bioinformatics with a focus on deep learning models for protein and DNA sequence data. Students completing the course will be able to analyze biological data, a skill they will develop through the hands-on lab component, which will consist of algorithm implementation and biological case studies using those algorithms. In addition to modern deep learning models for genomics data we will cover dynamic programming for sequence alignment, probabilistic methods for motif finding, and efficient string algorithms for read mapping. No previous biology background is assumed. Programming assignments will be in Python using modern machine learning libraries (PyTorch and scikit-learn). The class project will provide a more open-ended opportunity to apply these tools to realistic problems in genomics.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\">Prerequisites<\/h2>\n\n\n\n<p>(CS 320 or BZ 360) and CS 345.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\">Optional Textbook<\/h2>\n\n\n\n<p><a href=\"https:\/\/www.bioinformaticsalgorithms.org\/\">Bioinformatics algorithms<\/a>&nbsp;by Philippe Compeau and Pavel Pevzner.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\">Grading<\/h2>\n\n\n\n<p>Your grade for this course will be based on the following components:<\/p>\n\n\n\n<figure class=\"wp-block-table\"><table class=\"has-fixed-layout\"><thead><tr><th>Content<\/th><th class=\"has-text-align-left\" data-align=\"left\">Percentage of Grade<\/th><\/tr><\/thead><tbody><tr><td>Homework assignments (3)<\/td><td class=\"has-text-align-left\" data-align=\"left\">25%<\/td><\/tr><tr><td>Project<\/td><td class=\"has-text-align-left\" data-align=\"left\">25%<\/td><\/tr><tr><td>Exercises<\/td><td class=\"has-text-align-left\" data-align=\"left\">10%<\/td><\/tr><tr><td>Reading assignments<\/td><td class=\"has-text-align-left\" data-align=\"left\">  5%<\/td><\/tr><tr><td> Exams (midterm + final)<\/td><td class=\"has-text-align-left\" data-align=\"left\">35%<\/td><\/tr><\/tbody><\/table><\/figure>\n\n\n\n<p>We may tweak these percentages if needed.  We will use the following standard grading scale.  Threshold may be adjusted, but will not be increased (e.g. 90 guarantees an A).<\/p>\n\n\n\n<figure class=\"wp-block-table\"><table class=\"has-fixed-layout\"><thead><tr><th>Score<\/th><th class=\"has-text-align-left\" data-align=\"left\">Grade<\/th><\/tr><\/thead><tbody><tr><td>90-100<\/td><td class=\"has-text-align-left\" data-align=\"left\">A<\/td><\/tr><tr><td>80-89.9<\/td><td class=\"has-text-align-left\" data-align=\"left\">B<\/td><\/tr><tr><td>70-79.9<\/td><td class=\"has-text-align-left\" data-align=\"left\">C<\/td><\/tr><tr><td>60-69.9<\/td><td class=\"has-text-align-left\" data-align=\"left\">D<\/td><\/tr><tr><td>0-59.9<\/td><td class=\"has-text-align-left\" data-align=\"left\">F<\/td><\/tr><\/tbody><\/table><\/figure>\n\n\n\n<p><br><\/p>\n\n\n\n<h2 class=\"wp-block-heading\">Final exam<\/h2>\n\n\n\n<p>The final exam will be held at the regular scheduled time according to the&nbsp;<a href=\"https:\/\/registrar.colostate.edu\/final-exams\/\">registrar&#8217;s office<\/a>.<br><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Course description Modern biological techniques are generating a variety of large-scale data, and require sophisticated algorithms for their analysis. Tasks include assembling the genomic sequence, predicting the protein-coding regions, their function and the manner in which they are regulated. The course will provide a broad overview of computational techniques currently used in bioinformatics with a [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":0,"parent":0,"menu_order":0,"comment_status":"closed","ping_status":"closed","template":"template-fullwidth.php","meta":{"_kad_blocks_custom_css":"","_kad_blocks_head_custom_js":"","_kad_blocks_body_custom_js":"","_kad_blocks_footer_custom_js":"","footnotes":""},"class_list":["post-22","page","type-page","status-publish","hentry","post-preview"],"taxonomy_info":[],"featured_image_src_large":false,"author_info":{"display_name":"admin","author_link":"https:\/\/courses.cs.colostate.edu\/cs525\/author\/admin_41g0qmxe\/"},"comment_info":0,"_links":{"self":[{"href":"https:\/\/courses.cs.colostate.edu\/cs525\/wp-json\/wp\/v2\/pages\/22","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/courses.cs.colostate.edu\/cs525\/wp-json\/wp\/v2\/pages"}],"about":[{"href":"https:\/\/courses.cs.colostate.edu\/cs525\/wp-json\/wp\/v2\/types\/page"}],"author":[{"embeddable":true,"href":"https:\/\/courses.cs.colostate.edu\/cs525\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/courses.cs.colostate.edu\/cs525\/wp-json\/wp\/v2\/comments?post=22"}],"version-history":[{"count":6,"href":"https:\/\/courses.cs.colostate.edu\/cs525\/wp-json\/wp\/v2\/pages\/22\/revisions"}],"predecessor-version":[{"id":82,"href":"https:\/\/courses.cs.colostate.edu\/cs525\/wp-json\/wp\/v2\/pages\/22\/revisions\/82"}],"wp:attachment":[{"href":"https:\/\/courses.cs.colostate.edu\/cs525\/wp-json\/wp\/v2\/media?parent=22"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}