Statistics
There have been 2 completed talks and 7 topic suggestions tagged with statistics.
Related Tags
- information theory
- data science
- probability
- Tech Talks
- first year friendly
- markov chain
- approximation
- tutorial
Completed Talks
Markov Chain Monte Carlo
Delivered by Jacob Jackson on Wednesday October 4, 2017
The talk will introduce Markov chain Monte Carlo methods as a means of sampling from a distribution. The Metropolis-Hastings algorithm will be discussed as well as applications of Markov chain Monte Carlo for Bayesian inference and optimization.
Will expect familiarity with basic probability theory, especially conditional probability.
The slides for this talk are available at Jacob Jackson’s website.
Information Theory
Delivered by Sidhant Saraogi on Friday October 14, 2016
I will try to provide a brief introduction to Information Theory working towards motivating Shannon's Source Coding Theorem. We will use rather simple examples (for e.g. Repetition Codes) to explain the idea of noisy channels and similarly simple examples to explain the idea behind the theorem and eventually try to prove it for a rather specific example. (if we have the time !)
Talk Suggestions
Complex Event Processing Systems
The ever-increasing amount of information that needs to be processed has led to the development of Complex Event Processing systems such as Apache Storm or Twitter Heron. These systems distribute a workload over many machines in a cluster, and offer both efficiency and fault-tolerance.
Possible reference materials for this topic include
Kulkarni, S., Bhagat, N., Fu, M., Kedigehalli, V., Kellogg, C., Mittal, S., ... & Taneja, S. (2015, May). Twitter heron: Stream processing at scale. In Proceedings of the 2015 ACM SIGMOD International Conference on Management of Data (pp. 239-250). ACM. doi:10.1145/2723372.2742788
Quick links: Google search, arXiv.org search, propose to present a talk
Tech Talks computer science data science distributed system first year friendly parallel computing statistics
Compressed Sensing
Compressed sensing is about minimizing the information gathered and stored by sensors, reducing the need for file compression later on for transmission. This can reduce costs for certain applications, such as non-visible wavelength cameras.
Possible reference materials for this topic include
Quick links: Google search, arXiv.org search, propose to present a talk
algebra information theory linear algebra signal processing statistics
Dealing with Missing Data
Data are rarely perfect. Robust data science tools must have ways to deal with missing data. However, this is not always easy. A balance must be struck between performance and convenience.
Possible reference materials for this topic include
Quick links: Google search, arXiv.org search, propose to present a talk
Tech Talks computer science data science efficiency first year friendly statistics
Entropy in Mathematics and Information Theory
Possible reference materials for this topic include
Quick links: Google search, arXiv.org search, propose to present a talk
information theory probability statistics
Fermi Estimation
Physicist Enrico Fermi was known for his impressive order-of-magnitude estimation. He was famously able to estimate the yield of the Trinity test atomic bomb to about a factor of two. Why is Fermi estimation so useful and accurate? What are the techniques to make better approximations? How can and do scientists, mathematicians, engineers, and others use Fermi estimation in their work?
Possible reference materials for this topic include
Quick links: Google search, arXiv.org search, propose to present a talk
approximation physics statistics
Jupyter Notebooks
Jupyter Notebooks are a must-have for any data scientist or engineer. They are available for a wide variety of programming languages, particularly Python.
Possible reference materials for this topic include
Quick links: Google search, arXiv.org search, propose to present a talk
Tech Talks data science first year friendly statistics tutorial
Optimal Stopping Theory and the Secretary Problem
Possible reference materials for this topic include
Quick links: Google search, arXiv.org search, propose to present a talk