Data Quality Assessment Methodology
Data Quality Assessment Methodology
Daniela Delinschi, Rudolf Erdei, Emil Pasca, Oliviu Matei
Abstract. High-quality data is a precondition for reliable machine learning, analytics and decision support. This paper introduces a methodology for systematic data quality assessment that combines dimensions such as completeness, consistency, accuracy, timeliness and validity into a unified evaluation framework. The methodology defines reusable measurement procedures, scoring schemes and aggregation rules that can be applied at the level of individual fields, datasets or entire data pipelines. By providing a structured way to detect and quantify data quality issues, the proposed approach supports continuous monitoring, helps prioritise remediation actions and improves the trustworthiness of downstream data-driven services, with a particular focus on smart agriculture data ecosystems.
Keywords: data quality; data quality dimensions; assessment methodology; data governance; smart agriculture
📋 Cite this publication
Daniela Delinschi, Rudolf Erdei, Emil Pasca, Oliviu Matei, "Data Quality Assessment Methodology", Proc. 19th SOCO Int. Conf. on Soft Computing Models in Industrial and Environmental Applications, Springer, 2024, 2023.
Reference: Proc. 19th SOCO Int. Conf. on Soft Computing Models in Industrial and Environmental Applications, Springer, 2024.
Benefits and limitations of digitalization in managing European Social funded projects
Benefits and limitations of digitalization in managing European Social funded projectsMatei...
A Novel CNN Approach for Accurate Tomato Disease Classification
A Novel CNN Approach for Accurate Tomato Disease ClassificationOvidiu Cosma, Laura Cosma Abstract....
Design of a collaborative network for mapping digital skills for Industry 5.0
Design of a collaborative network for mapping digital skills for Industry 5.0Maria Gustavsson,...
Solving the clustered minimum routing tree problem using Prüfer-coding based hybrid genetic algorithms
Solving the clustered minimum routing tree problem using Prüfer-coding based hybrid genetic...
Augmenting API Security Testing with Automated LLM-Driven Test Generation
Augmenting API Security Testing with Automated LLM-Driven Test GenerationEmil Marian Pasca, Rudolf...
Privacy Assessment Methodology for Machine Learning Models and Data Sources
Privacy Assessment Methodology for Machine Learning Models and Data SourcesRudolf Erdei, Emil...
Aggregation Strategy for Federated Machine Learning Algorithm
Aggregation Strategy for Federated Machine Learning AlgorithmRudolf Erdei, Daniela Delinschi,...
Using Markov chains for determining the proximity contagion of smart specialization of localities
Using Markov chains for determining the proximity contagion of smart specialization of...
Advancements in Machine Learning Algorithms for Precision Crop Yield Prediction: A Comprehensive Review with focus on European Union
Advancements in Machine Learning Algorithms for Precision Crop Yield Prediction: A Comprehensive...
TPC Net: An Efficient CNN Architecture for Tomato Plant Disease and Pest Classification
TPC Net: An Efficient CNN Architecture for Tomato Plant Disease and Pest ClassificationOvidiu...
Enhancing API Security Testing against BOLA and Authentication Vulnerabilities through an LLM-Enhanced Framework
Enhancing API Security Testing against BOLA and Authentication Vulnerabilities through an...
A new vision of social behavior on genetic algorithm performance
A new vision of social behavior on genetic algorithm performanceAndreea Tatar, Nicolae Fat, Adrian...













0 Comments