dc.rights.license | All rights reserved | en_US |
dc.contributor.advisor | Torres Batista, Nelliud D. | |
dc.contributor.author | Del Valle Maldonado, Jaileen | |
dc.date.accessioned | 2022-07-05T19:02:57Z | |
dc.date.available | 2022-07-05T19:02:57Z | |
dc.date.issued | 2022 | |
dc.identifier.citation | Del Valle Maldonado, J. (2022). Image Object Recognition Using Apache Hadoop and Python [Unpublished manuscript]. Graduate School, Polytechnic University of Puerto Rico. | en_US |
dc.identifier.uri | http://hdl.handle.net/20.500.12475/1619 | |
dc.description | Design Project Article for the Graduate Programs at Polytechnic University of Puerto Rico | en_US |
dc.description.abstract | The amount of data generated by people each day on social media platforms is increasing at an alarming rate. Studies performed show that approximately 1.5 billion images are uploaded to the internet each day. Applications that can use and analyze this data are not available to all users due to limitations in processing power or storage space required for the analysis of these large datasets. Apache Hadoop is an open-source framework that allows distributed processing and fault tolerance of Big Data with the use of commodity hardware using Hadoop Distributed File System(HDFS)and MapReduce. Using HDFS data is stored in a distributed manner across different machines (data notes). The use of the MapReduce framework parallelized computing is available and manageable to be able to mine and analyze the image data available created by users. The focus of this article will be the analysis of image data in large datasets to create feature vectors using the k-means algorithm to group together images that contain similar objects inside them using Apache Hadoop, Map Reduce, Apache Spark, Computer Vision, and the Python programming language. Key Terms⎯K-Means Clustering, Map Reduce, Sequence File, Scale Invariant Feature Transform (SIFT) | en_US |
dc.language.iso | en | en_US |
dc.publisher | Polytechnic University of Puerto Rico | en_US |
dc.relation.ispartof | Computer Sciences; | |
dc.relation.ispartofseries | Spring-2022; | |
dc.relation.haspart | San Juan | en_US |
dc.subject.lcsh | Polytechnic University of Puerto Rico--Graduate students--Research | en_US |
dc.subject.lcsh | Polytechnic University of Puerto Rico--Graduate students--Posters | en_US |
dc.subject.lcsh | Apache Hadoop | |
dc.subject.lcsh | Big data | |
dc.subject.lcsh | Data mining | |
dc.subject.lcsh | Electronic data processing | |
dc.subject.lcsh | Python (Computer program language) | |
dc.title | Image Object Recognition Using Apache Hadoop and Python | en_US |
dc.type | Article | en_US |
dc.rights.holder | Polytechnic University of Puerto Rico, Graduate School | en_US |