• Banking Customer Data Analysis 

      Peralta Santiago, Anibal D. (Polytechnic University of Puerto Rico, 2014)
      This paper focuses on making a Data Mining analysis on customer data from a bank. We try to apply different methodologies and prove some concepts using data. Some of the goals of this analysis is to produce results ...
    • Data Mining Techniques for the Integrated Postsecondary Data System 

      Jiménez Vargas, Wilfredo (Polytechnic University of Puerto Rico, 2018)
      The Process of generating the necessary information for the Integrated Postsecondary Data System can be tedious; a standard way to collect the information for the different departments is needed. To process it, we use ...
    • Exploring Distributed Machine Learning System on Raspberry Pi Computer Cluster 

      Torres Torres, Isaac L. (Polytechnic University of Puerto Rico, 2021)
      This project explored the use of Distributed Machine Learning (DML) as a potential tool in training times of Machine Learning (ML) models in lower-end computer cluster, to provide alternatives for students and scientists ...
    • Healthcare Data Mining and Cleansing: A Study of Improving Data Quality for Effective Data Analysis on NPPES 

      Pérez Medina, Carlos A. (Polytechnic University of Puerto Rico, 2022)
      This article examines the use of data mining in the healthcare industry, with a particular emphasis on best practices for increasing data quality, preserving provider information, and applying advanced techniques to extract ...
    • Image Object Recognition Using Apache Hadoop and Python 

      Del Valle Maldonado, Jaileen (Polytechnic University of Puerto Rico, 2022)
      The amount of data generated by people each day on social media platforms is increasing at an alarming rate. Studies performed show that approximately 1.5 billion images are uploaded to the ...
    • Implementation of Data Mining Techniques and Machine Learning Model in Manufacturing Process 

      Vélez Báez, Héctor (Polytechnic University of Puerto Rico, 2021)
      Time in manufacturing process is critical. A lot of manufacturing companies rely on their operator’s knowledge, experience, or random methods to adjust CNC machines. Sometimes these judgements are ineffective and limited ...
    • An Overview of Web Scraping: Technical Aspects and Exercises 

      Pérez Molano, Gustavo (Polytechnic University of Puerto Rico, 2023)
      Researchers and organizations conducting different types of research can benefit from studying and using Web Scraping in a correct manner to further their research goals. This study serves as a review on some of the web ...
    • Puerto Rico Data Collection Web Platform: Puerto Rico Stats 

      Martínez Torres, José R. (Polytechnic University of Puerto Rico, 2023)
      Currently, the importance of data gathering and usage is at its highest point and constantly growing. Companies around the world are hiring data science teams to obtain and analyze massive amounts of data in order to ...
    • Social Mining Harvester, using Twitter 

      Jové Iguina, Alberto F. (Polytechnic University of Puerto Rico, 2013)
      This article showcases phase one of a system that is able to analyze the Social Web. Here we present how to create a harvester for the Social Network Twitter. We explain the different technologies used and how to proceed ...
    • Tecnología Inteligente de Minería de Datos en Telecomunicaciones 

      Berrios Negrón, Edwin J. (Polytechnic University of Puerto Rico, 2018)
      Las compañías de telecomunicaciones generan una enorme cantidad de datos. Estos incluyen; los datos de detalles de llamadas, que describe las llamadas que atraviesan las redes de telecomunicaciones, aunado a la red de datos ...