Skill Sets
Machine Learning Skills 
| Machine Learning | Deep Learning | Computer Vision |
|---|---|---|
| Linear Regression, Logistic Regression, Naïve Bayes, KNN, Support Vector Machine, Decision tree, Random forest, Bagging,Boosting, Clustering, PCA, Recommender Systems | Artificial Neural Networks (ANN), Recurrent neural network (RNN), Transfer learning | Convoluated Neural Network (CNN), Object detection, Image segmentation |
| Natural Language processing | ML Libraries | Statistical Techniques |
| TF-IDF, Word Embeddings, Long short-term memory network (LSTM), Sentiment Analysis | NumPy, Scikit-Learn, TensorFlow, Keras, OpenCV, NLTK | Hypothesis Testing, Exploratory Data Analysis, Outlier Detection, Imputations, Feature Engineering |
Computing Skills 
| Programming Languages | Visualization Techniques | Database Language |
|---|---|---|
| Python, R/Bioconductor, Unix/Shell | Seaborn, Pandas, Matplotlib, Plotly Dash | SQL |
| Version control | Cloud Computing | Containerization |
| Git, GitHub | AWS EC2, ECR, S3, RDS, SageMaker, Lambda, Autoscaling, CloudWatch, SNS | Docker |
Bioinformatics Skills 
| Next Generation Sequence Data Analysis |
|---|
| Quality Control: FastQC, Samtools, Cutadapt, Bedtools, Trimommatic, PICARD |
| De novo genome assembly: Abyss, Platanus, SOAP denovo, MASURCA, SPAdes, metaSPADES, Trinity, GapClosure |
| Variant calling: GATK, VarScan, mpileup |
| Single-cell analysis: Cell Ranger, Seurat |
| Methylation data analysis: HOMER, MAC, BISMARK |
| Metagenomics data analysis: QIIME, MG-RAST, UCLUST, UPARSE, VSEARCH, METAGENassitsit, MEGAN |
| Plasmid detection: Recycler, SPADES, plasmidSPADES |
| Annotation Tools: Prodigal, Augustus, RepeatMasker, GO , KASS, KEGG, COG Pathway analysis, SSR, Pfam |
| Visualization Tools: IGV genome browser, Circos, FigTree, MEME, GenomeVx, Brig, CONTIGuator |