(Part 2) Top products from r/bioinformatics
We found 22 product mentions on r/bioinformatics. We ranked the 73 resulting products by number of redditors who mentioned them. Here are the products ranked 21-40. You can also go back to the previous section.
21. Discrete Mathematics for New Technology, Second Edition
Sentiment score: 1
Number of reviews: 1
Used Book in Good Condition
22. Molecular Biology of the Cell, 5th Edition
Sentiment score: 1
Number of reviews: 1
Used Book in Good Condition
23. The Routledge Handbook of Philosophy of Information (Routledge Handbooks in Philosophy)
Sentiment score: 1
Number of reviews: 1
24. Campbell Biology (11th Edition)
Sentiment score: 1
Number of reviews: 1
This refurbished product is tested and certified to work properly. The product will have minor blemishes and/or light scratches. The refurbishing process includes functionality testing, basic cleaning, inspection, and repackaging. The product ships with all relevant accessories, and may arrive in a ...
26. How to Break Software: A Practical Guide to Testing W/CD
Sentiment score: 1
Number of reviews: 1
27. Life Out of Sequence: A Data-Driven History of Bioinformatics
Sentiment score: 1
Number of reviews: 1
28. Bioinformatics: The Machine Learning Approach, Second Edition (Adaptive Computation and Machine Learning) (Adaptive Computation and Machine Learning series)
Sentiment score: 0
Number of reviews: 1
29. Structure and Interpretation of Computer Programs - 2nd Edition (MIT Electrical Engineering and Computer Science)
Sentiment score: 1
Number of reviews: 1
NewMint ConditionDispatch same day for order received before 12 noonGuaranteed packagingNo quibbles returns
30. Biostatistics for the Biological and Health Sciences
Sentiment score: 0
Number of reviews: 1
TextbookBiostatistics
31. Concepts of Genetics (11th Edition)
Sentiment score: 0
Number of reviews: 1
Concepts of Genetics
32. Statistics for the Life Sciences (5th Edition)
Sentiment score: 0
Number of reviews: 1
33. Math Refresher for Scientists and Engineers
Sentiment score: 1
Number of reviews: 1
34. Biomedical Informatics: Computer Applications in Health Care and Biomedicine (Health Informatics)
Sentiment score: 0
Number of reviews: 1
35. Pattern Recognition and Machine Learning (Information Science and Statistics)
Sentiment score: 1
Number of reviews: 1
Springer
36. Dell Latitude E7240 Business Laptop, 12.5 screen, Intel Core i7-4600U, 8GB DDR3L RAM, 256GB SSD, Windows 10 Professional (Renewed)
Sentiment score: 1
Number of reviews: 1
Memory (4GBx2), 256GB Solid State Drive12.5" Gorilla Glass display802.11ac/a/b/g/n Dual Band Wireless, Bluetooth 4.0, 10/100/1000 Gigabit Ethernet LAN, HDMI, Mini DisplayPort, RJ-45, USB 3.0, Audio, E-Port ConnectorWindows 10 Professional, 4-cell (45WHr) Primary Lithium Ion BatteryIntel Core i7-4600...
37. ggplot2: Elegant Graphics for Data Analysis (Use R!)
Sentiment score: 1
Number of reviews: 1
Used Book in Good Condition
38. Rivals: Conflict As the Fuel of Science
Sentiment score: 0
Number of reviews: 1
Generally speaking, the following are the main considerations when evaluating a computer:
All that being said, without having a very good idea of what your day-to-day computing needs are, and what the heaviest-duty tasks you'll be doing will be, it's hard to know what to recommend.
If you have very good computing infrastructure, and won't need to do any development or heavy-lifting locally, then most of what you'll be doing with your personal computer is browsing the Internet and word processing. For that I honestly recommend a budget option, because it's difficult to tell the difference between a $500 machine and a $2000 machine with those tasks. A middle of the road Chromebook should do just fine, as will something like a Dell Latitude -- which is my personal machine, and I can recommend for both the performance/reliability of the computer itself and the efficacy of the refurbishing process. If you want a little more oomph and have a little more cash, the baseline MacBook Air should do just fine as well.
If you'll need to be doing development locally, and mild-to-moderate compute -- for instance if you're taking computational classes that require you to do homework on your own machine -- the best bang for your buck in a single machine is probably a MacBook Pro.
If you're pretty much on your own for all of your computing needs -- research that'll involve large datasets and/or ML, coursework, etc -- and/or you've got money to burn, I recommend building your own desktop for heavy computing and getting a budget laptop for mobility (remote access can be set up to give you access to your desktop from your laptop). This is where selecting components and knowing what you're doing is imperative, so this recommendation is totally spitballing, but something like this build should be able to tackle anything you can reasonably be expected to do on your own.
Bear in mind again that your needs may vary significantly from the needs that build is designed to address -- and that the price of video cards/GPUs is way inflated at the moment -- but that's a decent skeleton for a high-performance machine, and the walkthrough of setting it up is pretty good.
If you can give me more specifics I can try to refine my recommendations, but without any context I think this post should be a fair guideline as you're shopping for a machine.
Here are some books that I feel have made me better professionally. They tend toward the comp sci side, some are more useful than others.
Lastly, find some open source projects and read their papers, then read their code (and then the paper again, then the code...etc)! Then find their blogs and read those too. Then find them on Twitter and follow them. As others have said, the field is evolving very quickly, so half the battle is information sourcing.
Chapter 7 of Chris Bishop's book Pattern Recognition and Machine Learning has a nice intro to SVMs.
Here is a list of papers where SVMs were used in a computational biology
> Gene Function from microarray expression data
>
> Knowledge-based analysis of microarray gene expression data by using support vector machines, Michael P. S. Brown, William Noble Grundy, David Lin, Nello Cristianini, Charles Walsh Sugnet, Terence S. Furey, Manuel Ares, Jr., David Haussler, Proc. Natl. Acad. Sci. USA, vol. 97, pages 262-267
> pdf
> http://www.pnas.org/cgi/reprint/97/1/262.pdf
>
> Support Vector Machine Classification of Microarray Gene Expression Data, Michael P. S. Brown William Noble Grundy, David Lin, Nello Cristianini, Charles Sugnet, Manuel Ares, Jr., David Haussler
> ps.gz
> http://www.cse.ucsc.edu/research/compbio/genex/genex.ps
>
> Gene functional classification from heterogeneous data Paul Pavlidis, Jason Weston, Jinsong Cai and William Noble Grundy, Proceedings of RECOMB 2001
> pdf
> http://www.cs.columbia.edu/compbio/exp-phylo/exp-phylo.pdf
>
> Cancer Tissue classification
> from microarray expression data, and gene selection:
>
> Support vector machine classification of microarray data, S. Mukherjee, P. Tamayo, J.P. Mesirov, D. Slonim, A. Verri, and T. Poggio, Technical Report 182, AI Memo 1676, CBCL, 1999.
> ps.gz
> PS file here
>
> Support Vector Machine Classification and Validation of Cancer Tissue Samples Using Microarray Expression Data, Terrence S. Furey, Nigel Duffy, Nello Cristianini, David Bednarski, Michel Schummer, and David Haussler, Bioinformatics. 2000, 16(10):906-914.
> pdf
> http://bioinformatics.oupjournals.org/cgi/reprint/16/10/906.pdf
>
> Gene Selection for Cancer Classification using Support Vector Machines, I. Guyon, J. Weston, S. Barnhill and V. Vapnik, Machine Learning 46(1/3): 389-422, January 2002
> pdf
> http://homepages.nyu.edu/~jaw281/genesel.pdf
>
> Molecular classification of multiple tumor types ( C. Yeang, S. Ramaswamy, P. Tamayo, Sayan Mukerjee, R. Rifkin, M Angelo, M. Reich, E. Lander, J. Mesirov, and T. Golub) Intelligent Systems in Molecular Biology
>
> Combining HMM and SVM : the Fisher Kernel
>
> Exploiting generative models in discriminative classifiers, T. Jaakkola and D. Haussler, Preprint, Dept. of Computer Science, Univ. of California, 1998
> ps.gz
> http://www.cse.ucsc.edu/research/ml/papers/Jaakola.ps
>
> A discrimitive framework for detecting remote protein homologies, T. Jaakkola, M. Diekhans, and D. Haussler, Journal of Computational Biology, Vol. 7 No. 1,2 pp. 95-114, (2000)
> ps.gz
> PS file here
>
> Classifying G-Protein Coupled Receptors with Support Vector Machines, Rachel Karchin, Master's Thesis, June 2000
> ps.gz
> PSgz here
>
> The Fisher Kernel for classification of genes
>
> Promoter region-based classification of genes, Paul Pavlidis, Terrence S. Furey, Muriel Liberto, David Haussler and William Noble Grundy, Proceedings of the Pacific Symposium on Biocomputing, January 3-7, 2001. pp. 151-163.
> pdf
> http://www.cs.columbia.edu/~bgrundy/papers/prom-svm.pdf
>
> String Matching Kernels
>
> David Haussler: "Convolution kernels on discrete structures"
> ps.gz
> Chris Watkins: "Dynamic alignment kernels"
> ps.gz
> J.-P. Vert; "Support vector machine prediction of signal peptide cleavage site using a new class of kernels for strings"
> pdf
>
> Translation initiation site recognition in DNA
>
> Engineering support vector machine kernels that recognize translation initiation sites, A. Zien, G. Ratsch, S. Mika, B. Scholkopf, T. Lengauer, and K.-R. Muller, BioInformatics, 16(9):799-807, 2000.
> pdf.gz
> http://bioinformatics.oupjournals.org/cgi/reprint/16/9/799.pdf
>
> Protein fold recognition
>
> Multi-class protein fold recognition using support vector machines and neural networks, Chris Ding and Inna Dubchak, Bioinformatics, 17:349-358, 2001
> ps.gz
> http://www.kernel-machines.org/papers/upload_4192_bioinfo.ps
>
> Support Vector Machines for predicting protein structural class Yu-Dong Cai*1 , Xiao-Jun Liu 2 , Xue-biao Xu 3 and Guo-Ping Zhou 4
> BMC Bioinformatics (2001) 2:3
> http://www.biomedcentral.com/content/pdf/1471-2105-2-3.pdf
>
> The spectrum kernel: A string kernel for SVM protein classification Christina Leslie, Eleazar Eskin and William Stafford Noble Proceedings of the Pacific Symposium on Biocomputing, 2002
> http://www.cs.columbia.edu/~bgrundy/papers/spectrum.html
>
> Protein-protein interactions
>
> Predicting protein-protein interactions from primary structure w, Joel R. Bock and David A. Gough, Bioinformatics 2001 17: 455-460
> pdf
> http://bioinformatics.oupjournals.org/cgi/reprint/17/5/455.pdf
>
> Protein secondary structure prediction
>
> A Novel Method of Protein Secondary Structure Prediction with High Segment Overlap Measure: Support Vector Machine Approach, Sujun Hua and Zhirong Sun, Journal of Molecular Biology, vol. 308 n.2, pages 397-407, April 2001.
>
> Protein Localization
>
>
> Sujun Hua and Zhirong Sun Support vector machine approach for protein subcellular localization prediction Bioinformatics 2001 17: 721-728
>
>
> Various
>
> Rapid discrimination among individual DNA hairpin molecules at single-nucleotide resolution using an ion channel
> Wenonah Vercoutere, Stephen Winters-Hilt, Hugh Olsen, David Deamer, David Haussler, Mark Akeson
> Nature Biotechnology 19, 248 - 252 (01 Mar 2001)
>
> Making the most of microarray data
> Terry Gaasterland, Stefan Bekiranov
> Nature Genetics 24, 204 - 206 (01 Mar 2000)
You won't get more mileage in CS per hour of effort than you will by studying discrete mathematics. You need to know about:
Mathematical logic: logical connectives, truth tables, axioms, common techniques of proof -- especially proof by induction, which comes in handy when you want to prove the correctness of an algorithm
Elementary counting techniques and combinatorics
Sets, functions (you need to be familiar with injections, surjections, and bijections), relations, cardinalities
Graphs and trees (bonus points for implementing them as classes in your favorite language). Eulerian and Hamiltonian paths. Recognize that many computational problems are equivalent to searching for a node or a certain kind of path through a graph.
Those ideas will get you surprisingly far in computer science. Once you're familiar with those things, you'll be able to see where a lot of the other ideas are coming from.
At the very least, you'll have some tools to guide you so that any computer program you hack together won't be completely awful, even if it isn't optimal (it almost never needs to be optimal; it just needs to get the job done in a reasonable amount of time). After that, you can do some linear algebra and probability theory. After probability theory you will be able to think clearly about statistics. I recommend starting with this book, which is great for self-study:
http://www.amazon.com/Discrete-Mathematics-Technology-Second-Edition/dp/0750306521
Move as slowly as you need to. Try to do any of the exercises for which the solutions aren't immediately obvious to you. Even if you only get through a chapter or two, you'll be much better off than you would have been.
I can tell you what I think was the most importent stuff we have been doing so far in my bachelor.
BioChemistry
Cellbiology
IT Basics
Operating Systems
BioinformaticsBasics
So this is just supposed to be some kind of reference you can use to learning. You probably don't need to work through all of this.
But I strongly suggest reading about Biochemistry and Cellbiology(a nice book is Molecular Biology of the Cell) as it is really important for understanding bioinformatics.
Also give the link I posted in the Operating System part a look. Try to just use linux for a month as a lot of bioinformatics applications are written for linux and its nice to see the contrast to windows.
Regarding programming I suggest you search for a book that combines python + bioinformatics(something like this). If you want to focus on the programming part you would ideally start in ASM then switch to C then to Java and then to python.(Just to give you an impression why: ASM gives you a great insight into how the CPU works and how it acesses RAM. C is on a higher level and you start thinking about organising data and defining its structure in RAM. Java adds another layer onto that - you get objects, which make it easy for you to organize your data in blocks and there is no need for you to manage the RAM by hand with pointers like in C. But you still need to tell your variables specifically what they are. So if you have a variable that safes a Text in it you have to declare it as a string. Finally you arrived at python which is a scripting language. There is no more need for you to tell variables what they are - the compiler decides it automatically. All the annoying parts are automated. So your code becomes shorter as you don't need to type as much. The philosophy behind scripting languages is mostly to provide languages that are designed for humans not for machines).But it is kind of a overkill in your situation. Just focus on python. One final thing regarding programming just keep practicing. It is really hard at the beginning but once you get it, it starts making fun to programm as it becomes a creative way of expressing your logic.
Let's get to the bioinforamtics part. I don't think you really need to study this really hard but it's nice to be ahead of your commilitones. I recommand reading this book. You might also check out Rosalind and practice your python on some bioinformatics problems.
Edit: If you want I can send you some books as pdf files if you PM me your email adress
Consider learning some about the history of bioinformatics. This book is great: https://www.amazon.com/Life-Out-Sequence-Data-Driven-Bioinformatics/dp/022608020X
And, I’m always a huge advocate of the coursera courses. Take a look at the genomic data science and/or bioinformatics specializations on there.
Or, maybe even better, there’s this Harvard bioinformatics course that has lectures and materials available online. It’s really great. Some history, some statistics, and labs in R! Here: https://canvas.harvard.edu/courses/49497/pages/course-schedule
There are a few books out there that are "math refreshers", I think I even saw one titled "Math refresher for scientists and engineers"
https://www.amazon.com/Math-Refresher-Scientists-Engineers-Fanchi/dp/0471757152
I haven't personally read it myself but perhaps you could glance through and see if it interests you. It's pretty easy to find a pdf of as well
Bioethics in bioinformatics, especially in a clinical context, is a fairly active area. It can be viewed as a subfield of computing ethics or the ethics of information.. For e.g. see: Ethics in Computing and Information Ethics and Philosophy of Information. As bio-augmentation technologies proliferate, issues surrounding the personal, ethical, legal, and socio-philosophical implications of bioinformation - its generation, use, storage, handling, persistence , ownership - will get quite complex. So, your thoughts may be worth having!
The NIH Bioethics department may be good place to investigate. If you are an experienced professional, just go on and write a paper and ship it off to a journal. If you don't know where to start, put something on paper and find a collaborator that you can work with - they may be found in hospitals, law schools and/or departments of philosophy, social science .... endless options.
NIH Bioethics
Fellowships
> I'd recommend focusing more on basic biology textbooks
This is a really good idea. I would recommend Campbell Biology for general biology at the intro level and Concepts of Genetics for genetics
Yeah, I think it's pretty wild stuff. It just blows my mind that the biochemical network within a cell can be influenced by the emergent properties of the network itself (instead of any physical or chemical properties). The behavior of a network translates across applications - it's weird. Here's what got me into it - it's a pretty approachable read for the topic.
> What kind of testing do you use for data analysis (ie when you don't know what the result should be)? Do you go so far as to make fake data where you know the answer?
Yes. You craft fake data which will yield a known outcome. Then you run the analysis. Then you compare the results of analysis against the expected outcome. This can become tedious, so you automate the testing with scripts or frameworks for that purpose. In general:
Perhaps you are further along in your understanding, but it sounds like you may still be trying to wrap your mind around the entire woolly concept of testing. If that is the case, you might benefit from How to Break Software. Unfortunately, that book is not about data analysis, specifically, but testing analytical code is a special case of testing other kinds of code.
Many, I would recommend this book in particular: http://www.amazon.co.uk/Bioinformatics-Learning-Approach-Adaptive-Computation/dp/026202506X
There's a book about it, called Rivals
\>> Authors getting shafted of major credits
What about Raymond Gosling, Franklin's Phd student who took Photo 51?
My stats introduction was with 'Statistics for the life sciences', by Samuels, Witmer, Schaffner: https://www.amazon.com/Statistics-Life-Sciences-Myra-Samuels/dp/0321989589.
If you want more entry-level ML stuff, go for 'Elements of statistical learning' that /u/gamazeps also linked.
Remembering to distinguish bioinformatics from biomedical informatics, if you are looking for the latter, I'd suggest extending what you already know - this is a good Link to that start. https://www.coursera.org/learn/clinical-data-management
The reference book on biomedical informatics is the Shortliffe text: https://www.amazon.com/Biomedical-Informatics-Computer-Applications-Biomedicine/dp/0387289860
If you have zero stats background, try getting a textbook from a college level intro course (I used this one) and going through the chapters and doing the problems. If you are past the intro level, find a higher level course text.