RPubs

by RStudio

mikesilva

Michael Silva

Recently Published

DATA 624 Midterm Project 1C - Water Flow

over 5 years ago

DATA 624 Midterm Project 1B - Power

over 5 years ago

DATA 624 Midterm Project 1A - ATM

over 5 years ago

DATA 621 Week #2 Textbook Exercises

almost 6 years ago

DATA 621 Week #1 Textbook Exercises

almost 6 years ago

DATA 608 Exploratory Data Analysis

Exploratory data visualization with R

almost 6 years ago

Total Arcade Revenue and the Number of PhD Computer Science Graduates

A tong in cheek regression analysis of an interesting spurious correlation.

about 6 years ago

This study set out to see if the size of a home and the era in which it was built could be used to get an estimate of the sale price. While these variables seem to be relevant, predictions made only using the variables are largely inaccurate. It appears that the relationship is more complicated

over 6 years ago

Multiple and Logistic Regression

over 6 years ago

CUNY DATA 606 - Lab 8

over 6 years ago

Intro to Linear Regression

over 6 years ago

CUNY DATA 606 - Lab 7

over 6 years ago

CUNY SPS DATA-607 Week 12 Assignment

MySQL to NoSQL (MongoDB) Migration Exercise

over 6 years ago

Document Classifier Project

Is the email message spam or ham? This SVM can help

over 6 years ago

CUNY DATA 606 Inference for Categorical Data

over 6 years ago

CUNY DATA 606 Lab 6

over 6 years ago

Inference for Numerical Data

over 6 years ago

CUNY SPS DATA 606 Lab 5

over 6 years ago

CUNY DATA 607 - Week 9 Assignment

The assignment is to choose one of the New York Times APIs, construct an interface in R to read in the JSON data, and transform it to an R dataframe.

over 6 years ago

Skills of a Data Scientist

Our motivation for this study is to gain an understanding of which skills are the most useful for a data scientist to have so that we can plan what courses to take in our Master’s program. We scraped 453 job postings from Dice.com. The top skills include: Python, Machine Learning, Data Analysis, SQL and R.

over 6 years ago

CUNY SPS DATA 606 Foundations for Inference

over 6 years ago

CUNY SPS DATA 606 Lab 4b

over 6 years ago

CUNY SPS DATA 606 Lab 4a

Foundations for statistical inference - Sampling distributions

over 6 years ago

DATA 607 - Week 7 Assignment

In this assignment I have selected three books that I enjoyed reading and have entered information these three books in separately created files. One encodes the information in HTML, XML and JSON formats. I am to pull in the data into R and answer if the three data frames are identical.

over 6 years ago

DATA 606 Data Project Proposal

over 6 years ago

DATA 607 Project 2

Creating 3 tidy data sets

almost 7 years ago

CUNY SPS DATA 606 Distributions of Random Variables

Graded problems for the OpenIntro Statistics chapter on Distributions of Random Variables. My answers to the questions are in bold.

almost 7 years ago

CUNY SPS DATA 607 Week 5 Assignment

Analysis of two airlines arrival ratings for five airports

almost 7 years ago

CUNY SPS DATA 606 Lab 3

almost 7 years ago

CUNY SPS DATA 606 Probability

Graded problems for the Intro to Data chapter of Probability. My answers to the questions are in bold.

almost 7 years ago

CUNY SPS DATA-606 Lab 2

almost 7 years ago

CUNY SPS DATA-607 Week 3 Assignment

A little fun with regular expressions.

almost 7 years ago

CUNY SPS DATA-607 Project 1

This project extracts player information from a semi-structured text file.

almost 7 years ago

Hands on Lab 2-3: Combining Data from Multiple Sources

Your task is to create an R dataframe that shows rates of tuberculosis infection by country. You should use the information from the tb database and the attached population.csv file. Your R dataframe should have the following columns: Country, Year, Rate (Where Rate is defined as Cases/Population).

almost 7 years ago

CUNY SPS DATA-607 Week 2 Assignment

Choose six recent popular movies. Ask at least five people that you know (friends, family, classmates, imaginary friends) to rate each of these movie that they have seen on a scale of 1 to 5. Take the results (observations) and store them in a SQL database. Load the information into an R dataframe. The films used in this study were selected by Googling “2018 recent films” and getting the first 6 films listed in the search results.

almost 7 years ago

CUNY SPS DATA-606 Lab 1

Introduction to Data

almost 7 years ago

CUNY SPS DATA 606 Intro to Data

Graded problems for the Intro to Data chapter of Open Intro to Statistics. My answers to the questions are bolded.

almost 7 years ago

CUNY SPS DATA-606 Lab 0

Introduction to R and RStudio

almost 7 years ago

CUNY SPS DATA-607 Week 1 Assignment

The assignment is to load the Mushroom Dataset from the UCI repository into a data frame, rename the columns into something meaningful, replace the abbreviations used in the data, and subset the columns of the data frame. It must include the column that indicated if the mushroom is edible or poisonous and three or four other columns.

almost 7 years ago

CUNY SPSS Data Science Math Homework 3

almost 7 years ago

CUNY SPS Data Science Math Homework 2

almost 7 years ago

CUNY SPS Data Science Math Homework 1

almost 7 years ago

Metro Employment Index

This document explains how the metro employment index is derived. This index is base on the Local Area Unemployment Statistics (LAUS) published by the U.S. Bureau of Labor Statistics (BLS). It is seasonally adjusted using the Loess seasonal decomposition method.

over 10 years ago

The Economic and Health Consequences of Storms

This analysis was preformed as part of Coursera's Reproducible Research course.

almost 11 years ago

Sign In

mikesilva

Michael Silva

Recently Published