Fuel Sales Forecasting with SARIMA-GARCH and Rolling Window
Volume-5 | Issue-3

An Accurate Bitcoin Price Prediction using logistic regression with LSTM Machine Learning model
Volume-3 | Issue-3

Nepali Image Captioning: Generating Coherent Paragraph-Length Descriptions Using Transformer
Volume-6 | Issue-1

A Comprehensive Review on Advanced Driver Assistance System
Volume-4 | Issue-2

A Novel Approach based on PSO and Coloured Petri Net for improving Services in the Emergency Department
Volume-5 | Issue-1

Credit Risk Analysis using Explainable Artificial Intelligence
Volume-6 | Issue-3

Implications of Tokenizers in BERT Model for Low-Resource Indian Language
Volume-4 | Issue-4

Design of Distribution Transformer Health Management System using IoT Sensors
Volume-3 | Issue-3

Energy Management System in the Vehicles using Three Level Neuro Fuzzy Logic
Volume-3 | Issue-3

Cloud Load Estimation with Deep Logarithmic Network for Workload and Time Series Optimization
Volume-3 | Issue-3

An Integrated Approach for Crop Production Analysis from Geographic Information System Data using SqueezeNet
Volume-3 | Issue-4

An Accurate Bitcoin Price Prediction using logistic regression with LSTM Machine Learning model
Volume-3 | Issue-3

Design of Distribution Transformer Health Management System using IoT Sensors
Volume-3 | Issue-3

Design of a Food Recommendation System using ADNet algorithm on a Hybrid Data Mining Process
Volume-3 | Issue-4

Automatic Diagnosis of Alzheimer’s disease using Hybrid Model and CNN
Volume-3 | Issue-4

Effective Prediction of Online Reviews for Improvement of Customer Recommendation Services by Hybrid Classification Approach
Volume-3 | Issue-4

Acoustic Features Based Emotional Speech Signal Categorization by Advanced Linear Discriminator Analysis
Volume-3 | Issue-4

Analysis of Statistical Trends of Future Air Pollutants for Accurate Prediction
Volume-3 | Issue-4

Identification of Electricity Threat and Performance Analysis using LSTM and RUSBoost Methodology
Volume-3 | Issue-4

Review on Data Securing Techniques for Internet of Medical Things
Volume-3 | Issue-3

Home / Archives / Volume-4 / Issue-3 / Article-2

Volume - 4 | Issue - 3 | september 2022

Analysis of AI based Data Wrangling Methods in Intelligent Knowledge Lakes Open Access
D. Sasikala  , K. Venkatesh Sharma  269
Pages: 129-140
Cite this article
Sasikala, D., and K. Venkatesh Sharma. "Analysis of AI based Data Wrangling Methods in Intelligent Knowledge Lakes." Journal of Soft Computing Paradigm 4, no. 3 (2022): 129-140
DOI
10.36548/jscp.2022.3.002
Published
30 August, 2022
Abstract

A novel conception of Knowledge Lake, i.e., a Contextualized Data Lake is to be soundly educated. The deliberated big-data practices pave a means for the erection of Intelligent Knowledge Lakes and that being the resources for big-data applications and analytics. This analysis likewise opens the welfares, disputes, and exploration prospects of Intelligent Knowledge Lakes. Data Science is launched as an influential discernment through businesses. Organizations today are dedicated on transforming their facts into ultra-practical intuitions. This work is challenging, as in present day’s intelligence, amenity and cloud customary budget trades accumulate immense aggregates of unprocessed data after a variety of funds. Data Lakes are familiar as a packing depository that fetch concurrently the unprocessed data in its innate set-up (sustaining to NoSQL from relational databases) which is crucial. The logic behind Data Lake is to stockpile unprocessed data and let the data analyst resolve the way to curate them well ahead of reviewing the idea of Knowledge Lake, which is an anecdotal Data Lake. The Intelligent Knowledge Lake stipulate the basis for big data analytics by robotically curating the unprocessed data in the Data Lake grooming these for stemming intuitions via programmed interactive real-time optimized data wrangling in intelligent knowledge lakes. Computerization of an exposed free public Data and Knowledge Lake amenity provides developers and researchers a distinct REST API to systematize, curate, catalog and interrogate their data and metadata in the Lake for a longer time. It administers manifold database/databank know-hows (from Relational to NoSQL) that deals with an inherent scheme for data security, curation, and provenance.

Keywords

Data wrangling data munging artificial intelligence data lakes knowledge lakes express analytics optimization

×

Currently, subscription is the only source of revenue. The subscription resource covers the operating expenses such as web presence, online version, pre-press preparations, and staff wages.

To access the full PDF, please complete the payment process.

Subscription Details

Category Fee
Article Access Charge
15 USD
Open Access Fee Nil
Annual Subscription Fee
200 USD
After payment,
please send an email to irojournals.contact@gmail.com / journals@iroglobal.com requesting article access.
Subscription form: click here