Healthcare Data Mining and Cleansing: A Study of Improving Data Quality for Effective Data Analysis on NPPES
Pérez Medina, Carlos A.
MetadataShow full item record
This article examines the use of data mining in the healthcare industry, with a particular emphasis on best practices for increasing data quality, preserving provider information, and applying advanced techniques to extract valuable insights from complicated data sets. The research gathered information from the National Plan and Provider Enumeration System (NPPES) and then analyzed the data to determine whether or not there were any issues with the information. The information was sorted into its two basic groups, which were establishments and service providers. The headers were modified accordingly, the information was standardized by the application of analysis and processing, and any null values have been removed. In the context of data utilization on healthcare provision across the nation, questions of ethics, including the protection of individuals' right to privacy and the confidentiality of health information, were discussed and highlighted as critical components. Key terms: Data Assurance, Data Mining, Healthcare, National Provider Identifier.