Abstract
There has been tremendous growth for the need of analytics and BI tools in every organization, in every sector such as finance, software, medicine and even astronomy in order to better overall performance. C-factor Computing has the same vision of empowering their existing products through data analysis and forecasting to better suit the need of customers and decision making of stakeholders. The project involves 5 key aspects in Analytics - Data Acquisition, Big data or data Storage, Data Transformation (Unstructured to Structured), Data Wrangling, Predictive Modeling / Visualization. Data Acquisition involves gathering existing transactional and search data of customers and travel aggregators who use the product. This data is used to create powerful dashboards capable of predictive analytics which help the company make informed choices. The key aspects mentioned can be achieved through various tools available but requires testing at every stage in order to realize the appropriate software for the data present in the company. Hence the project deals with studying and implementing selected tools in order to provide the right framework to achieve an interactive dashboard capable of predictive analytics which can also be integrated into the existing products of the company.
References
- Ariyo, A. O. Adewumi and C. K. Ayo (2014) ‘Stock Price Prediction Using the ARIMA Model’, UKSim-AMSS 16th International Conference on Computer Modelling and Simulation, Cambridge, pp. 106-112, doi: 10.1109/UKSim.2014.67.
- Singh, N. Thakur and A. Sharma (2016) ‘A review of supervised machine learning algorithms,’ 3rd International Conference on Computing for Sustainable Global Development (INDIACom), New Delhi, pp. 1310-1315.
- V. Patil and R. S. Bichkar, (2006) ‘A Hybrid Evolutionary Approach To Construct Optimal Decision Trees With Large Data Sets,’ IEEE International Conference on Industrial Technology, Mumbai, pp. 429-433, doi: 10.1109/ICIT.2006.372250.
- F. F. Lubis, Y. Rosmansyah and S. H. Supangkat, (2014) ‘Gradient descent and normal equations on cost function minimization for online predictive using linear regression with multiple variables,’ International Conference on ICT For Smart Society (ICISS), Bandung, pp. 202-205, doi: 10.1109/ICTSS.2014.7013173.
- F. Harrou, M. Nounou and H. Nounou, (2013) ‘A statistical fault detection strategy using PCA based EWMA control schemes,’ 9th Asian Control Conference (ASCC), Istanbul, pp. 1-4, doi: 10.1109/ASCC.2013.6606311.
- G. Kesavaraj and S. Sukumaran, (2013) ‘A study on classification techniques in data mining,’ Fourth International Conference on Computing, Communications and Networking Technologies (ICCCNT), Tiruchengode, pp. 1-7. DOI: 10.1109/ICCCNT.2013.6726842
- H. Lan and Y. Pan, (2019) ‘A Crowdsourcing Quality Prediction Model Based on Random Forests,’ IEEE/ACIS 18th International Conference on Computer and Information Science (ICIS), Beijing, China, pp. 315-319, doi: 10.1109/ICIS46139.2019.8940306.
- H. Xie and F. Shang, (2014) ‘The study of methods for post-pruning decision trees based on comprehensive evaluation standard,’ 11th International Conference on Fuzzy Systems and Knowledge Discovery (FSKD), Xiamen, pp. 903-908, doi: 10.1109/FSKD.2014.6980959.
- H. Yi, Q. Xiong, Q. Zou, R. Xu, K. Wang and M. Gao, (2019) ‘A Novel Random Forest and its Application on Classification of Air Quality,’ 8th International Congress on Advanced Applied Informatics (IIAI-AAI), Toyama, Japan, pp. 35-38, doi: 10.1109/IIAI-AAI.2019.00018.
- Jing Chai, Hongwei Liu and Zheng Bao,(2009) ‘A W-KNN classifier to improve radar outlier rejection performance,’ IET International Radar Conference, Guilin, pp. 1-4, doi: 10.1049/cp.2009.0106.
- Kumar, P. Satheesh, P. Chitra, and S. Sneha. "Design of Improved Quadruple-Mode Bandpass Filter Using Cavity Resonator for 5G Mid-Band Applications." Future Trends in 5G and 6G: Challenges, Architecture, and Applications (2021): 219
- P. Guleria and M. Sood, (2018) ‘Predictive Data Modelling: Educational Data Classification and Comparative Analysis of Classifiers Using Python,’ Fifth International Conference on Parallel, Distributed and Grid Computing (PDGC), Solan Himachal Pradesh, India, pp. 740-746. DOI: 10.1109/PDGC.2018.87457.
- R. De Croon, J. Klerkx and E. Duval, (2015) ‘Interactive proof-of-concept dashboard to explore patient follow-up in general practice,’ 9th International Conference on Pervasive Computing Technologies for Healthcare (PervasiveHealth), Istanbul, pp. 233-236, doi: 10.4108/icst.pervasivehealth.2015.258991.
- Satheesh Kumar P., Jeevitha, Manikandan (2021) Diagnosing COVID-19 Virus in the Cardiovascular System Using ANN. In: Oliva D., Hassan S.A., Mohamed A. (eds) Artificial Intelligence for COVID-19. Studies in Systems, Decision and Control, vol 358. Springer, Cham. https://doi.org/10.1007/978-3-030-69744-0_5.
- W. Wang and Y. Guo,(2009) ‘Air Pollution PM2.5 Data Analysis in Los Angeles Long Beach with Seasonal ARIMA Model,’ International Conference on Energy and Environment Technology, Guilin, Guangxi, pp. 7-10, doi: 10.1109/ICEET.2009.468.
- W. Wang and Z. Niu, (2009) ‘Time Series Analysis of NASDAQ Composite Based on Seasonal ARIMA Model,’ International Conference on Management and Service Science, Wuhan, pp. 1-4, doi: 10.1109/ICMSS.2009.5300866.
- Yan-ming Yang, Hui Yu and Zhi Sun, (2017) ‘Aircraft failure rate forecasting method based on Holt-Winters seasonal model,’ IEEE 2nd International Conference on Cloud Computing and Big Data Analysis (ICCCBDA), Chengdu, pp. 520-524, doi: 10.1109/ICCCBDA.2017.7951969.
