RPubs

by RStudio

Recently Published

Estadística de Primaria

By JJTilano

Este es un currículo unificado de Estadística para Primaria

7 months ago

Kiểm định tỷ lệ một tổng thể (Binomial Test)

By mankyconguyet

Nghiên cứu này trình bày ứng dụng của Kiểm định Nhị thức (Binomial Test), một phương pháp thống kê chính xác, để kiểm định giả thuyết về tỷ lệ một tổng thể. Sử dụng công cụ R/RStudio, bài viết phân tích bộ dữ liệu thực tế về thói quen sử dụng mạng xã hội của học sinh và sinh viên. Mục tiêu là rút ra các kết luận có ý nghĩa thống kê về các vấn đề như tỷ lệ nghiện mạng xã hội và tác động tiêu cực của nó đến kết quả học tập.

7 months ago

H517 Research Paper Summary

By mfrontz

My summary of a 2019 research paper on how visualization design can affect how users perceive causality in correlated variables

7 months ago

Energy Usage Analysis System

By ssmith

This analysis uses the Energy Usage Analysis System dataset, which tracks energy consumption across government facilities. It focuses on electricity, natural gas, fuel oil, chilled water, and steam usage from 1973 to 2019, as published by the U.S. General Services Administration

7 months ago

Homework 5 - October 15,2025

By michellevigil

7 months ago

Tidying Wide Datasets to produce Long Datasets

By Candace63

Tidying wide datasets involves transforming data from a format where multiple measurements are spread across separate columns into a long format where each row represents a single observation. In wide format, each subject or entity occupies one row with many columns representing different variables or time periods, which can make filtering, grouping, and visualization challenging. The transformation process uses functions like `pivot_longer()` in R or `melt()` in Python to collapse multiple measurement columns into two key columns: one identifying the type of measurement and another containing the actual value. This restructuring follows tidy data principles where each variable forms a column, each observation forms a row, and each type of observational unit forms a table, making the data more suitable for statistical analysis and machine learning algorithms. The result is a dataset with more rows but fewer columns that is easier to filter by measurement type, create visualizations with, and analyze using modern data science tools.

7 months ago

Animated Suppression Gap (ΔX) Map: Revealing Hidden Suicide Burden Across Reporting Ranges

By eswepston

This interactive map visualizes how suicide data visibility changes as reporting ranges expand. Each frame shows the ΔX (percentage-point drop in data suppression) between consecutive range lengths. Red regions represent states where suicide data become visible only with multi-year aggregation: highlighting areas of chronic, low-frequency suicide burden that remain masked in annual reporting. This set includes only the beta test range data, and does not include rolling range data.

7 months ago

RPubs

Recently Published

Estadística de Primaria

Kiểm định tỷ lệ một tổng thể (Binomial Test)

H517 Research Paper Summary

Energy Usage Analysis System

Homework 5 - October 15,2025

Tidying Wide Datasets to produce Long Datasets

Animated Suppression Gap (ΔX) Map: Revealing Hidden Suicide Burden Across Reporting Ranges

Bryce Nations

Names of 2024

HW-3

Nations homework

Practical Assignment

Sign In

Recently Published