BIOS823: Biomedical Big Data¶
Notebooks:
- S01 Relational Databases
- S02 RDBMS and SQL
- MongoDB
- Redis
- Neo4j
- Setting up NoSQL servers on Mac
- Scalars
- Vectors
- Matrices
- Sparse Matrices
- Working with Matrices
- Solving Linear Equations
- Linear least squares
- Solving Linear Systems
- Dimension Reduction
- Nonlinear Dimension Reduction
- Anomaly Detection
- Clustering
- Understanding the SVD
- Applications of SVD
- Finding Roots of Equations
- Numerical Optimization
- Algorithms for Optimization and Root Finding for Multivariate Problems
- Using optimization routines from
scipy
andstatsmodels
- Solving Optimization Problems Computationally
- Line search in gradient and Newton directions
- Least squares optimization
- Gradient Descent Optimizations
- Constrained Optimization
- Introduction to C
- Introduction to C++
- Hello world
- Type conversions
- Command line inputs
- Functions
- Header, implementation and driver files
- Using
make
- A more flexible Makefile
- Input and output
- Arrays
- Loops
- Function arguments
- Anonymous functions
- Function pointers
- Generic programming with templates
- Standard template library (STL)
- STL algorithms
- Random numbers
- Numerics
- Native code compilation
- Just-in-time compilation (JIT)
- Cython
- Using
pybind11
- Modules and Packaging
- Parallel Programming
- Multi-Core Parallelism
- Vanilla Python
- Using
numba
to speed up computation - Using
cython
to speed up computation - The
concurrent.futures
module - Using processes in parallel with
ProcessPoolExecutor
- Using processes in parallel with ThreadPoolExecutor
- Turning off the GIL in
cython
- Using processes in parallel with
ThreadPoolExecutor
andnogil
- Using
multiprocessing
- Common issues with use of shared memory in parallel programs
- Parallel Programming Example
- Using
ipyparallel
- Enhancing performance
- Text Analysis
- Text Analysis
- Time Series Analysis 1
- Time Series Analysis II
Homework:
- Assignment 1 Review
- Assignment 2: Data formats
- Assignment 3: Working with Relational Databases and SQL
- Assignment 4: Linear Algebra Review
- Assignment 5: Unsupervised Learning
- Assignment 6: Topic Modeling with Latent Semantic Analysis
- Assignment 7: Conjugate Gradient Descent
- Assignment 8: Supervised Learning
- Assignment 9: Improving performance
- Assignment 10: Review