Data Science

Expertise in the following:

  • Inferential and Descriptive Statistics
  • Regression: Linear, Multiple, shrinkage methods, subset selection methods, ridge, lasso, Tree-based methods, etc.
  • Classification: Random Forests & tree-based methods, Bayes classifier, KNN, Logistic, LDA, QDA, etc.
  • Artificial Neural Networks using R and Python
  • Resampling methods (CV, LOOCV, Bootstrapping, etc.)
  • Unsupervised techniques: Clustering methods incl. K-Means, Hierarchical, etc.
  • Dimensionality reduction methods incl. PCA
  • Social network analysis: Influence maximization methods (Independent Cascade, Linear Threshold, Simulated based algorithms CELF, etc.)
  • Graph algorithms (BFS, k-way, Shortest Path, Spectral Bisection, etc.)
  • Centrality algorithms (Degree, Closeness, Betweenness, PageRank, etc.)
  • Community detection algorithms (Clustering Coefficient, Connected Components, etc.)
  • Other skills: data visualization (Tableau), Databricks, Spark, Scala
  • Intermediate expertise in: Non-Linear methods (splines, etc.), General Additive Models
  • Beginner expertise in: Deep learning, Reinforcement learning