Data Visualization Part 2: Student Project

The final project consists of a visualization and a text exercise. For this project you need to:

1: Visualization Exercise

Here, three alternative visualizations of the same artificial data shall be created by you. All plots display fictitious genomic annotations together with fictitious RNA binding protein data. The visualizations shall display RNA binding protein signals and link these binding information to genomic annotations to shed light on potential biological function. For each version a reference is shown. Two different datasets are provided:

The 10_project_data_annotations.csv file contains fictitious genomic information as visualized in all bottom panels of the example plots. Each horizontal line represents a transcript. A transcript can contain multiple exons (grey rectangles). Transcripts can be located on the '+' or on the '-' strand of the DNA.

10_project_data_signals.csv contains fictitious signals of four RNA binding proteins (P1, P2, P3, P4).

1.1: Version 1

1.2: Version 2

1.3: Version 3

1.4: Discussion

Discuss the pros and cons between the different visualization approaches.

2: Text Exercise

Summarize the article 'Pencil and paper' by Wong & Kjaergaard. What are the key points? Do not copy and paste from the article. Summarize in your own words.