Figure 1: Overview of the Nike Shoe Sales Analysis Project.
Nike is one of the largest and most recognized athletic shoe brands globally, renowned for its high-quality products and innovative designs. To understand consumer acceptance of Nike shoes, sales data from various online platforms (Nike official website, Amazon, and other retailers) was collected. This data includes product information, pricing, discounts, ratings, reviews, and images. The analysis was conducted using the CRISP-DM method and SAS data analysis tools to identify consumer trends and preference patterns. The results of this analysis are expected to provide useful insights for designing more effective product and marketing strategies, enhancing customer satisfaction, and optimizing Nike's sales.
Understanding how Nike shoes are received by consumers is crucial for strategic decision-making. The challenge lies in extracting meaningful insights from a diverse dataset that includes product information, pricing, discounts, ratings, reviews, and images from various online platforms. Traditional analysis methods may not fully uncover the complex patterns and predictive behaviors needed for effective marketing and product development. In the era of data-driven decision-making, machine learning becomes an invaluable tool to discover intricate patterns and predict consumer behavior, especially in complex sales contexts. The goal is to identify the best algorithms to recognize patterns that reflect consumer behavior through historical data analysis.
This research aims to explore the application of various machine learning algorithms, including Decision Tree, Support Vector Machine (SVM), and Logistic Regression (as primary focus), as well as Random Forest, Gradient Boosting, and Neural Networks, in the analysis of Nike shoe sales. By applying the CRISP-DM methodology, our study aims to conduct a comprehensive comparative analysis to understand the factors influencing consumer preferences. The objective is to find the best algorithm for recognizing patterns that reflect consumer behavior through historical data analysis, ultimately helping to design better product and marketing strategies for Nike and other shoe brands, and assisting consumers in making more informed purchasing decisions.
As an author and researcher for the "Nike Shoe Sales Analysis" project (for the UAS IS429 BDA Even Theory 2023-2024 course) in the Department of Informatics, Universitas Multimedia Nusantara, my responsibilities included the comprehensive analysis of Nike sales data. My role involved the application of the CRISP-DM methodology, detailed data understanding and preparation, selection and implementation of multiple machine learning algorithms (Random Forest, Decision Tree, Gradient Boosting, Neural Network, Linear Regression), evaluation of model performance using SAS Visual Analytics, and the interpretation of results to uncover consumer trends. I contributed to all phases of the data mining process, from initial business understanding to model deployment and the final report.
Figure 2: Data Understanding and Preparation Phase in SAS Visual Analytics.
In this research, we applied the CRISP-DM (Cross-Industry Standard Process for Data Mining) methodology, which includes Business Understanding, Data Understanding, Data Preparation, Modeling, Evaluation, and Deployment. This framework provided a structured guide for our data mining project.
Figure 4: CRISP-DM Flowchart for Data Mining Process.
We leveraged SAS Visual Analytics to build and evaluate predictive models for Nike shoe ratings. Each algorithm provided unique insights into variable importance and prediction accuracy:
Figure 5: Various Machine Learning Models and their Visualizations in SAS.
The implementation of various predictive models in Nike shoe sales analysis has provided valuable insights into consumer trends and product rating predictions. Each model, from Random Forest to Neural Network and Linear Regression, contributed significantly to understanding factors influencing Nike shoe ratings, highlighting different data aspects and offering unique insights. The Neural Network and Linear Regression models stood out as the best performers, with Neural Network excelling in complex variable relationships and Linear Regression providing interpretable linear relationships with high accuracy. These insights are invaluable for business decision-makers to understand consumer behavior, optimize marketing strategies, develop products aligned with consumer preferences, and enhance customer satisfaction. The results are crucial for making data-driven decisions to enhance overall business profitability and success.
The use of various predictive models in Nike shoe sales analysis provides deep insights into consumer trends and product rating predictions. While Neural Network and Linear Regression emerged as the best models, the strengths of each model can be utilized to support better business decisions. For future development, key recommendations include enhancing data quality through better cleaning and additional collection, exploring advanced machine learning techniques like Deep Learning for higher data complexity, and integrating these models into existing management systems for real-time operational decision-making. Furthermore, investing in internal staff training for effective model operation and continuously monitoring model performance are crucial to ensure reliability and responsiveness to market changes. By applying these recommendations, the predictive models in Nike shoe sales analysis can become even more effective and generate a more positive impact for the company.