The MatLab bioinformatics is a toolbox which includes modules which allow you to read information from FASTA, SAM, CEL, CDF files. You can also access information GenBank and NCBI Gene Expression Omnibus. The program can open the files and then present the data found within the files as visuals ( sequence browsers, spatial heatmaps, and clustergrams). It also has "statistical techniques for detecting peaks, imputing values for missing data, and selecting features"
After reviewing the overview of the bioinformatics program on MatLab, I have come to the conclusion that it would be no help to me. The primary purpose of the program is to allow scientists to input their data, code for how they'd like the program to organize the data and then analyze the data. This would help the scientist come to conclusions about the data faster than had they just looked at completely unorganized data. Also helps them detect trends. I am not trying to analyze data at this point. I am trying to collect all the proteins functions and place them in one place. Later on, once I have designed the website I could use Matlab to analyze all the data I've collected and come to a conclusion myself. Scientists would ideally go through and take all the data I've compiled and then organize it themselves and come to a conclusion themselves. I am just the mediator, I allow them to access all the data in one place and then they analyze it.
What I take from this: I need to include the protein sequence in a FASTA file so that if researchers do what to put my data into MatLab and analyze the trends they can.
No comments:
Post a Comment