# Statistics

There have been 2 completed talks and 7 topic suggestions tagged with **statistics**.

## Related Tags

- information theory
- data science
- probability
- Tech Talks
- first year friendly
- markov chain
- approximation
- tutorial

## Completed Talks

### Markov Chain Monte Carlo

Delivered by Jacob Jackson on Wednesday October 4, 2017

The talk will introduce Markov chain Monte Carlo methods as a means of sampling from a distribution. The Metropolis-Hastings algorithm will be discussed as well as applications of Markov chain Monte Carlo for Bayesian inference and optimization.

Will expect familiarity with basic probability theory, especially conditional probability.

The slides for this talk are available at Jacob Jacksonâ€™s website.

### Information Theory

Delivered by Sidhant Saraogi on Friday October 14, 2016

I will try to provide a brief introduction to Information Theory working towards motivating Shannon's Source Coding Theorem. We will use rather simple examples (for e.g. Repetition Codes) to explain the idea of noisy channels and similarly simple examples to explain the idea behind the theorem and eventually try to prove it for a rather specific example. (if we have the time !)

## Talk Suggestions

### Complex Event Processing Systems

The ever-increasing amount of information that needs to be processed has led to the development of Complex Event Processing systems such as Apache Storm or Twitter Heron. These systems distribute a workload over many machines in a cluster, and offer both efficiency and fault-tolerance.

Possible reference materials for this topic include

Kulkarni, S., Bhagat, N., Fu, M., Kedigehalli, V., Kellogg, C., Mittal, S., ... & Taneja, S. (2015, May). Twitter heron: Stream processing at scale. In Proceedings of the 2015 ACM SIGMOD International Conference on Management of Data (pp. 239-250). ACM. doi:10.1145/2723372.2742788

Quick links: Google search, arXiv.org search, propose to present a talk

Tech Talks computer science data science distributed system first year friendly parallel computing statistics

### Compressed Sensing

Compressed sensing is about minimizing the information gathered and stored by sensors, reducing the need for file compression later on for transmission. This can reduce costs for certain applications, such as non-visible wavelength cameras.

Possible reference materials for this topic include

Quick links: Google search, arXiv.org search, propose to present a talk

algebra information theory linear algebra signal processing statistics

### Dealing with Missing Data

Data are rarely perfect. Robust data science tools must have ways to deal with missing data. However, this is not always easy. A balance must be struck between performance and convenience.

Possible reference materials for this topic include

Quick links: Google search, arXiv.org search, propose to present a talk

Tech Talks computer science data science efficiency first year friendly statistics

### Entropy in Mathematics and Information Theory

Possible reference materials for this topic include

Quick links: Google search, arXiv.org search, propose to present a talk

information theory probability statistics

### Fermi Estimation

Physicist Enrico Fermi was known for his impressive order-of-magnitude estimation. He was famously able to estimate the yield of the Trinity test atomic bomb to about a factor of two. Why is Fermi estimation so useful and accurate? What are the techniques to make better approximations? How can and do scientists, mathematicians, engineers, and others use Fermi estimation in their work?

Possible reference materials for this topic include

Quick links: Google search, arXiv.org search, propose to present a talk

approximation physics statistics

### Jupyter Notebooks

Jupyter Notebooks are a must-have for any data scientist or engineer. They are available for a wide variety of programming languages, particularly Python.

Possible reference materials for this topic include

Quick links: Google search, arXiv.org search, propose to present a talk

Tech Talks data science first year friendly statistics tutorial

### Optimal Stopping Theory and the Secretary Problem

Possible reference materials for this topic include

Quick links: Google search, arXiv.org search, propose to present a talk