Search Content

Predictive Modeling of 4th Down Selection in Power 5 Conference: Data Analytics

Description

Predictive analytics have been used in a wide variety of settings, including healthcare,
sports, banking, and other disciplines. We use predictive analytics and modeling to
determine the impact of certain factors that increase the probability of a successful
fourth down conversion in the Power 5 conferences. The logistic regression models…

Predictive analytics have been used in a wide variety of settings, including healthcare,
sports, banking, and other disciplines. We use predictive analytics and modeling to
determine the impact of certain factors that increase the probability of a successful
fourth down conversion in the Power 5 conferences. The logistic regression models
predict the likelihood of going for fourth down with a 64% or more probability based on
2015-17 data obtained from ESPN’s college football API. Offense type though important
but non-measurable was incorporated as a random effect. We found that distance to go,
play type, field position, and week of the season were key leading covariates in
predictability. On average, our model performed as much as 14% better than coaches
in 2018.

ContributorsBlinkoff, Joshua Ian (Co-author) / Voeller, Michael (Co-author) / Wilson, Jeffrey (Thesis director) / Graham, Scottie (Committee member) / Dean, W.P. Carey School of Business (Contributor) / Department of Information Systems (Contributor) / Department of Management and Entrepreneurship (Contributor) / Barrett, The Honors College (Contributor)

Created2019-05

Predictive Modeling of 4th Down Selection in Power 5 Conference: Data Analytics

Description

Predictive analytics have been used in a wide variety of settings, including healthcare, sports, banking, and other disciplines. We use predictive analytics and modeling to determine the impact of certain factors that increase the probability of a successful fourth down conversion in the Power 5 conferences. The logistic regression models…

Predictive analytics have been used in a wide variety of settings, including healthcare, sports, banking, and other disciplines. We use predictive analytics and modeling to determine the impact of certain factors that increase the probability of a successful fourth down conversion in the Power 5 conferences. The logistic regression models predict the likelihood of going for fourth down with a 64% or more probability based on 2015-17 data obtained from ESPN’s college football API. Offense type though important but non-measurable was incorporated as a random effect. We found that distance to go, play type, field position, and week of the season were key leading covariates in predictability. On average, our model performed as much as 14% better than coaches in 2018.

ContributorsVoeller, Michael Jeffrey (Co-author) / Blinkoff, Josh (Co-author) / Wilson, Jeffrey (Thesis director) / Graham, Scottie (Committee member) / Department of Information Systems (Contributor) / Department of Finance (Contributor) / Barrett, The Honors College (Contributor)

Created2019-05

Rethinking the Job Search: Bringing Technology Undergraduates a Better Understanding of their Job Market

Description

The objective of this project was the creation of a web app for undergraduate CIS/BDA students which allows them to search for jobs based on criteria that are not always directly available with the average job search engine. This includes technical skills, soft skills, location and industry. This…

The objective of this project was the creation of a web app for undergraduate CIS/BDA students which allows them to search for jobs based on criteria that are not always directly available with the average job search engine. This includes technical skills, soft skills, location and industry. This creates a more focused way for these students to search for jobs using an application that also attempts to exclude positions that are looking for very experienced employees. The activities used for this project were chosen in attempt to make as many of the processes as automatable as possible.
This was achieved by first using offline explorer, an application that can download websites, to gather job postings from Dice.com that were searched by a pre-defined list of technical skills. Next came the parsing of the downloaded postings to extract and clean the data that was required and filling a database with that cleaned data. Then the companies were matched up with their corresponding industries. This was done using their NAICS (North American Industry Classification System) codes. The descriptions were then analyzed, and a group of soft skills was chosen based on the results of Word2Vec (a group of models that assists in creating word embeddings). A master table was then created by combining all of the tables in the database. The master table was then filtered down to exclude posts that required too much experience. Lastly, the web app was created using node.js as the back-end. This web app allows the user to choose their desired criteria and navigate through the postings that meet their criteria.

ContributorsHenry, Alfred (Author) / Darcy, David (Thesis director) / Moser, Kathleen (Committee member) / Department of Information Systems (Contributor, Contributor) / Barrett, The Honors College (Contributor)

Created2019-05

Investigating the Relationship between Neighborhood Socioeconomic Status and Proximity to Public Services

Description

With growing levels of income inequality in the United States, it remains as important as ever to ensure indispensable public services are readily available to all members of society. This paper investigates four forms of public services (schools, libraries, fire stations, and police stations), first by researching the background of…

With growing levels of income inequality in the United States, it remains as important as ever to ensure indispensable public services are readily available to all members of society. This paper investigates four forms of public services (schools, libraries, fire stations, and police stations), first by researching the background of these services and their relation to poverty, and then by conducting geospatial and regression analysis. The author uses Esri's ArcGIS Pro software to quantify the proximity to public services from urban American neighborhoods (census tracts in the cities of Phoenix and Chicago). Afterwards, the measures indicating proximity are compared to the socioeconomic statuses of neighborhoods using regression analysis. The results indicate that pure proximity to these four services is not necessarily correlated to socioeconomic status. While the paper does uncover some correlations, such as a relationship between school quality and socioeconomic status, the majority of the findings negate the author's hypothesis and show that, in Phoenix and Chicago, there is not much discrepancy between neighborhoods and the extent to which they are able to access vital government-funded services.

ContributorsNorbury, Adam Charles (Author) / Simon, Alan (Thesis director) / Simon, Phil (Committee member) / Department of Information Systems (Contributor) / Department of English (Contributor) / Department of Economics (Contributor) / Barrett, The Honors College (Contributor)

Created2018-05

Data Analytics to Identify the Genetic Basis for Resilience to Temperature Stress in Soybeans

Description

This paper explores the ability to predict yields of soybeans based on genetics and environmental factors. Based on the biology of soybeans, it has been shown that yields are best when soybeans grow within a certain temperature range. The event a soybean is exposed to temperature outside their accepted range…

This paper explores the ability to predict yields of soybeans based on genetics and environmental factors. Based on the biology of soybeans, it has been shown that yields are best when soybeans grow within a certain temperature range. The event a soybean is exposed to temperature outside their accepted range is labeled as an instance of stress. Currently, there are few models that use genetic information to predict how crops may respond to stress. Using data provided by an agricultural business, a model was developed that can categorically label soybean varieties by their yield response to stress using genetic data. The model clusters varieties based on their yield production in response to stress. The clustering criteria is based on variance distribution and correlation. A logistic regression is then fitted to identify significant gene markers in varieties with minimal yield variance. Such characteristics provide a probabilistic outlook of how certain varieties will perform when planted in different regions. Given changing global climate conditions, this model demonstrates the potential of using data to efficiently develop and grow crops adjusted to climate changes.

ContributorsDean, Arlen (Co-author) / Ozcan, Ozkan (Co-author) / Travis, Daniel (Co-author) / Gel, Esma (Thesis director) / Armbruster, Dieter (Committee member) / Parry, Sam (Committee member) / Industrial, Systems and Operations Engineering Program (Contributor) / Department of Information Systems (Contributor) / Barrett, The Honors College (Contributor)

Created2018-05

Attendance Elasticity of Win Percentage in the NBA: An Exploration of the Effects of Team Performance on Home Game Attendance

Description

In the wide world of sports, not all fan bases are created equally—especially in the NBA. Differences in factors like tradition, history, team performance amongst teams make each fan base distinctly unique. This paper will analyze how team performance effects one component of fan behavior: home game attendance. Using win-loss…

In the wide world of sports, not all fan bases are created equally—especially in the NBA. Differences in factors like tradition, history, team performance amongst teams make each fan base distinctly unique. This paper will analyze how team performance effects one component of fan behavior: home game attendance. Using win-loss data and home game attendance data for each NBA team from 2001 to 2017, I will construct statistical models to estimate how great of an impact team performance has on each team’s home game attendance. I expect each team’s fan base to respond differently to changes in their team’s win-loss record. This paper will also attempt to quantify other facts that impact attendance at NBA games, including year-to-year changes in team salary expenditures, regional income, and the number of star players playing for the team. Finally, this paper will explore the factors that affect home game attendance for specific games within a given season—things like weather, strength of opponent, and win streaks. Ultimately, the goal of this paper will be to provide NBA business analysts with resources to more precisely anticipate their team’s home game attendance. The ability to understand what motivates the behavior of a fan base is invaluable in creating a marketing strategy that drives fans to the arena. This paper will help to identify teams that are most susceptible to significant fluctuations in attendance and outline alternative strategies to positioning their product offering effectively to fans.

ContributorsSloan, Jacob Marlow (Author) / Lee, Christopher (Thesis director) / Eaton, John (Committee member) / Department of Marketing (Contributor) / Department of Information Systems (Contributor) / Barrett, The Honors College (Contributor)

Created2018-05

Impulse Spending with Data

Description

Data is ever present in the world today. Data can help predict presidential elections, Super Bowl champions, and even the weather. However, it's very hard, if not impossible, to predict how people feel unless they tell us. This is when impulse spending with data comes in handy. Companies are constantly…

Data is ever present in the world today. Data can help predict presidential elections, Super Bowl champions, and even the weather. However, it's very hard, if not impossible, to predict how people feel unless they tell us. This is when impulse spending with data comes in handy. Companies are constantly looking for ways to get honest feedback when they are doing market research. Often, the research obtained ends up being unreliable or biased in some way. Allowing users to make impulse purchases with survey data is the answer. Companies can still gather the data that they need to do market research and customers can get more features or lives for their favorite games. It becomes a win-win for both users and companies. By adding the option to pay with information instead of money, companies can still get value out of frugal players. Established companies might not care so much about the impulse spending for purchases made in the application, however they would find a great deal of value in hearing about what customers think of their product or upcoming event. The real value from getting data from customers is the ability to train analytics models so that companies can make better predictions about consumer behavior. More accurate predictions can lead to companies being better prepared to meet the needs to the customer. Impulse spending with data provides the foundation to creating a software that can create value from all types of users regardless of whether the user is willing to spend money in the application.

ContributorsYotter, Alexandria Lee (Author) / Olsen, Christopher (Thesis director) / Sopha, Matthew (Committee member) / Department of Information Systems (Contributor) / Barrett, The Honors College (Contributor)

Created2016-12

Twitter Analytics

Description

Twitter is one of the most powerful communication tools ever created. There are over 1.3 billion registered Twitter users (Smith, 2016). 100 million daily people actively use Twitter every day. 6,000 tweets are tweeted every second. Communication has never been so abundant, public, and chronicled. Not only is there a…

Twitter is one of the most powerful communication tools ever created. There are over 1.3 billion registered Twitter users (Smith, 2016). 100 million daily people actively use Twitter every day. 6,000 tweets are tweeted every second. Communication has never been so abundant, public, and chronicled. Not only is there a gigantic population to market to, but also a wealth of information about that population to record and draw insights from. However, many companies' Twitter accounts fail to generate popular posts on a regular basis. The content that they produce is ineffective and uninteresting. In my opinion, these companies are failing to take advantage of a huge opportunity. I decided to dive into the Twitter accounts of some of my favorite companies to see what they were doing wrong and how they could improve. My thesis investigates 18 different company Twitter accounts from four different industries: Athletic Apparel, Technology, Online Entertainment, and Car Manufacturing. I pulled 200 tweets from each company and cleaned and organized the data into an Excel spreadsheet. I investigated how certain variables impacted tweet popularity across the four industries. First, I looked at tweet format to determine whether posts, retweets, or replies were the best format. Then, I analyzed how different elements of a tweet's content could impact the tweet's popularity. Specifically, I looked at the effects of including links, hashtags, and questions into the tweet. Next, I tried to determine the optimal tweet length for each industry. And finally, I compared each industry's tweet sentiment preferences. I then summarized my findings into a series of recommendations for companies to improve their tweet popularity.

ContributorsFrame, Christopher James (Author) / Clark, Joseph (Thesis director) / Jenkins, Anthony (Committee member) / Department of Information Systems (Contributor) / Department of Finance (Contributor) / Barrett, The Honors College (Contributor)

Created2016-05

Energy Internet of Things Collaborative Thesis

Description

For our collaborative thesis we explored the US electric utility market and how the Internet of Things technology movement could capture a possible advancement of the current existing grid. Our objective of this project was to successfully understand the market trends in the utility space and identify where a semiconductor…

For our collaborative thesis we explored the US electric utility market and how the Internet of Things technology movement could capture a possible advancement of the current existing grid. Our objective of this project was to successfully understand the market trends in the utility space and identify where a semiconductor manufacturing company, with a focus on IoT technology, could penetrate the market using their products. The methodology used for our research was to conduct industry interviews to formulate common trends in the utility and industrial hardware manufacturer industries. From there, we composed various strategies that The Company should explore. These strategies were backed up using qualitative reasoning and forecasted discounted cash flow and net present value analysis. We confirmed that The Company should use specific silicon microprocessors and microcontrollers that pertained to each of the four devices analytics demand. Along with a silicon strategy, our group believes that there is a strong argument for a data analytics software package by forming strategic partnerships in this space.

ContributorsLlazani, Loris (Co-author) / Ruland, Matthew (Co-author) / Medl, Jordan (Co-author) / Crowe, David (Co-author) / Simonson, Mark (Thesis director) / Hertzel, Mike (Committee member) / Department of Economics (Contributor) / Department of Finance (Contributor) / Department of Supply Chain Management (Contributor) / Department of Information Systems (Contributor) / Hugh Downs School of Human Communication (Contributor) / Barrett, The Honors College (Contributor)

Created2016-05

Sagebrush Coffee: Applying Data Analytics to Customer Purchases

Description

Sagebrush Coffee is a small business in Chandler, Arizona that purchases green beans, roasts them in small batches for quality, and ships fresh, gourmet roasted coffee beans across the nation. Deciding which coffee beans to buy and roast is one of the most crucial business decisions Sagebrush and other gourmet…

Sagebrush Coffee is a small business in Chandler, Arizona that purchases green beans, roasts them in small batches for quality, and ships fresh, gourmet roasted coffee beans across the nation. Deciding which coffee beans to buy and roast is one of the most crucial business decisions Sagebrush and other gourmet coffee roasters face. Further complicating this decision is the fact that coffee is a crop, and like all crops, has a specific growing season and the exact same product cannot usually be ordered from year to year, even if it proves to be successful. The goal of this research is to use data analytics and visualization to help Sagebrush make better purchasing decisions by identifying consumer purchasing trends and providing a recommendation for their portfolio mix. In the end, I found that Latin American coffees are popular with both returning and first-time customers, but a specific country of origin does not appear to be associated with the top coffee producing countries. Additionally, December is a critical month for Sagebrush and Sagebrush should make sure to target the states with the most sales: California, Pennsylvania, and New York. Arizona has growth potential as it is not one of the top three locations, despite the presence of a physical store. Also included in the following report is a portfolio recommendation suggesting how many of each product based on region, processing type, and roast level to carry in inventory.

ContributorsBlue, Jessica Morgan (Author) / Kellso, James (Thesis director) / Davila, Eddie (Committee member) / Department of Information Systems (Contributor) / Economics Program in CLAS (Contributor) / Department of Supply Chain Management (Contributor) / Morrison School of Agribusiness (Contributor) / Barrett, The Honors College (Contributor)

Created2018-05

Filtering by