Homework 4
Note that most problems have a ".ext" file extensions for the deliverable.
This implies that you can use R or Python to solve the problem.
For this assignment each problem that you submit code for, 3 points will be awarded for completing the project entirely on the server.
This will give you practice dealing with coding and testing in that environment.
For each of the problems, add a comment at the top stating if it was done entirely on the server or if Rstudio or any other IDE was used. There is no plotting or pdf's involved, so that should make it easier.
- Write a script that solves the mystery of the Polya urn: Polya.pdf. The Polya urn is actually a concept popular for some Bayesian non-parametric models. I had some fun making an epic scenario for this though.
-
Is LeBron Slowing Down?
It's no secret that age can impact athletic performance. This task aims to see if the great
LeBron James, one of the best players in NBA history, is showing signs of slowing down as he ages.
Using a permutation/randomization test, you'll compare LeBron's points per game (PPG) during his
prime season (2013-2014) to his most recent season (2023-2024). Are the differences significant?
-
Download the data from lebron_points_per_game.csv.
- Test the null hypothesis that there is no difference in PPG between the two seasons.
Use a randomization test to determine whether LeBron's scoring has significantly changed.
- Submit your code and results in a file named "lebron_test.ext".
- Using Monte Carlo integration, find the integral of the function f(x) =
1-2/pi*(1-x)^2*exp(x)/x*asin(sqrt(x)) for x in [0,1]. Be more creative that
simply sampling from the uniform distribution. As always, assess your Monte
Carlo error. Submit your solution in a file named "integral.ext"
- A mark-recapture design is a common method used in ecology to determine the size of a population when it is impractical to count every individual of a population. The idea is to mark a certain number of species and then release them. Later you will again observe a certain number and count how many are marked. Suppose a pond has an unknown number of fish where each fish has an equal chance of being caught. You capture, mark, and release 100 of the fish. You then recapture 100 fish and observe 16 are marked. Use a simulation study to test the null hypothesis that there are 1000 fish in the pond against the alternative that there is not 1000 fish in the pond. Submit your work in a file called fish.ext
- Submit this homework by committing the necessary files (including any
data files) in the appropriate directory and pushing to your central Git
repository. Remember to not commit files that are easily reproducible.