gravatar

Tanner26

Tanner Norton

Recently Published

US Train Fatalities 2000 - 2018
All data used in this project was gathered from the Federal Railroad Administrations website. The question I sought to know is, "In what state do most train fatalities occur in and what characteristics describe those accidents?".
US Train Deaths 2018
I downloaded from [Federal Railroad Administration Office of Safety Analysis]("https://safetydata.fra.dot.gov/OfficeofSafety/publicsite/on_the_fly_download.aspx") website. The data contains all railroad accidents that involved casualties from 2018 in the United States.
Day Care Fines: ANOVA
A famous study performed in Israel in 1998 which looked day care centers and the tardiness of parents. This analysis uses an ANOVA test to determine the effectiveness of issuing a fine to late parents.
College Baseball - Linear Regression
Several factors go into a baseball team winning, an old phrase “Pitching wins championships” singles out pitching as most influential of them all. There are tons of pitching statistics but for this analysis, we are going to focus on “Earned Run Average” or ERA. The data collected is the 2019 team ERA from all 297 division 1 college teams. The question to answer is do teams with lower ERA’s have higher win percentages?
Cadillac Deville - Linear Regression
The purpose of this analysis is to create a model that provides the best fit possible for the data that comes from the Cadillac Deville. I created a new model to represent the Deville using a new explanatory variable which is Trim. There are three different Trims to the Deville which are the DHS (most luxiourious), DTS (best performance), and the Sedan (most basic).
High School Seniors: T-test
Data collected via survey of High school seniors across the nation. The purpose of this experiment is to determine if students who prefer water to caffeinated drinks get significantly different amounts of sleep on school nights. A T-test was set up to appropriately test the question.
Loan Default - Logistic Regression
The purpose of this analysis will be to use simple linear regression on a dataset containing information about past loan recipients and therefore be able to predict an individuals odds of defaulting on their Home equity loan. In this case the debt to income ratio was used as the explanatory variable.
Marital Status & Income: Chi Squared Analysis
The Data used in this analysis was extracted by Barry Becker from the 1994 Census Database. The purpose of the analysis will be to find out “if an individuals Income bracket is independent of their Marital status or related?”. A Chi Squared test was used to perform the analysis.
Candy - Simple Logistic Regression
The purpose of this analysis is to predict based off the candies winning percentage if the candy has chocolate. The data comes from Walt Hickey and can be found with the following link, https://fivethirtyeight.com/features/the-ultimate-halloween-candy-power-ranking/ . The analysis will also allow us to understand if peoples favorite candies are chocolate based or not. Simple Logistic Regression will be used with the independent binary variable indicating if the candy has chocolate or not.
Recalling Words Analysis
The purpose of this analysis is to determine whether or not the Meshed or Before methods have any positive benefit on memory recall. The dataset used is named Friendly and is in car library found in R. A Wilcoxon rank sum test was performed.
Document
This is an analysis using a fictional character Stephanie who is seeking BYUI approved student housing in Rexburg, ID. She has certain requirements for the location, size, and pricing of the apartment complex. Real data was used in order to help her narrow down the best options.