Statistics

There have been 2 completed talks and 7 topic suggestions tagged with statistics.

Related Tags

Completed Talks

Markov Chain Monte Carlo

Delivered by Jacob Jackson on Wednesday October 4, 2017

The talk will introduce Markov chain Monte Carlo methods as a means of sampling from a distribution. The Metropolis-Hastings algorithm will be discussed as well as applications of Markov chain Monte Carlo for Bayesian inference and optimization.

Will expect familiarity with basic probability theory, especially conditional probability.

The slides for this talk are available at Jacob Jackson’s website.

Information Theory

Delivered by Sidhant Saraogi on Friday October 14, 2016

I will try to provide a brief introduction to Information Theory working towards motivating Shannon's Source Coding Theorem. We will use rather simple examples (for e.g. Repetition Codes) to explain the idea of noisy channels and similarly simple examples to explain the idea behind the theorem and eventually try to prove it for a rather specific example. (if we have the time !)

Talk Suggestions

Complex Event Processing Systems

The ever-increasing amount of information that needs to be processed has led to the development of Complex Event Processing systems such as Apache Storm or Twitter Heron. These systems distribute a workload over many machines in a cluster, and offer both efficiency and fault-tolerance.

Possible reference materials for this topic include

Quick links: Google search, arXiv.org search, propose to present a talk

Tech Talks computer science data science distributed system first year friendly parallel computing statistics

Compressed Sensing

Compressed sensing is about minimizing the information gathered and stored by sensors, reducing the need for file compression later on for transmission. This can reduce costs for certain applications, such as non-visible wavelength cameras.

Possible reference materials for this topic include

Quick links: Google search, arXiv.org search, propose to present a talk

algebra information theory linear algebra signal processing statistics

Dealing with Missing Data

Data are rarely perfect. Robust data science tools must have ways to deal with missing data. However, this is not always easy. A balance must be struck between performance and convenience.

Possible reference materials for this topic include

Quick links: Google search, arXiv.org search, propose to present a talk

Tech Talks computer science data science efficiency first year friendly statistics

Entropy in Mathematics and Information Theory

Possible reference materials for this topic include

Quick links: Google search, arXiv.org search, propose to present a talk

information theory probability statistics

Fermi Estimation

Physicist Enrico Fermi was known for his impressive order-of-magnitude estimation. He was famously able to estimate the yield of the Trinity test atomic bomb to about a factor of two. Why is Fermi estimation so useful and accurate? What are the techniques to make better approximations? How can and do scientists, mathematicians, engineers, and others use Fermi estimation in their work?

Possible reference materials for this topic include

Quick links: Google search, arXiv.org search, propose to present a talk

approximation physics statistics

Jupyter Notebooks

Jupyter Notebooks are a must-have for any data scientist or engineer. They are available for a wide variety of programming languages, particularly Python.

Possible reference materials for this topic include

Quick links: Google search, arXiv.org search, propose to present a talk

Tech Talks data science first year friendly statistics tutorial

Optimal Stopping Theory and the Secretary Problem

Possible reference materials for this topic include

Quick links: Google search, arXiv.org search, propose to present a talk

decision theory probability statistics