Fast Text Classification with Naive Bayes Method on Apache Spark

dc.authoridOGUL, Iskender Ulgen/0000-0003-4882-5266
dc.contributor.authorOgul, Iskender Ulgen
dc.contributor.authorOzcan, Caner
dc.contributor.authorHakdagli, Ozlem
dc.date.accessioned2024-09-29T16:11:35Z
dc.date.available2024-09-29T16:11:35Z
dc.date.issued2017
dc.departmentKarabük Üniversitesien_US
dc.description25th Signal Processing and Communications Applications Conference (SIU) -- MAY 15-18, 2017 -- Antalya, TURKEYen_US
dc.description.abstractThe increase in the number of devices and users online with the transition of Internet of Things (IoT), increases the amount of large data exponentially. Classification of ascending data, deletion of irrelevant data, and meaning extraction have reached vital importance in today's standards. Analysis can be done in various variations such as Classification of text on text data, analysis of spam, personality analysis. In this study, fast text classification was performed with machine learning on Apache Spark using the Naive Bayes method. Spark architecture uses a distributed in-memory data collection instead of a distributed data structure presented in Hadoop architecture to provide fast storage and analysis of data. Analyzes were made on the interpretation data of the Reddit which is open source social news site by using the Naive Bayes method. The results are presented in tables and graphsen_US
dc.description.sponsorshipTurk Telekom,Arcelik A S,Aselsan,ARGENIT,HAVELSAN,NETAS,Adresgezgini,IEEE Turkey Sect,AVCR Informat Technologies,Cisco,i2i Syst,Integrated Syst & Syst Design,ENOVAS,FiGES Engn,MS Spektral,Istanbul Teknik Univen_US
dc.identifier.isbn978-1-5090-6494-6
dc.identifier.issn2165-0608
dc.identifier.urihttps://hdl.handle.net/20.500.14619/8538
dc.identifier.wosWOS:000413813100584en_US
dc.identifier.wosqualityN/Aen_US
dc.indekslendigikaynakWeb of Scienceen_US
dc.language.isotren_US
dc.publisherIeeeen_US
dc.relation.ispartof2017 25th Signal Processing and Communications Applications Conference (Siu)en_US
dc.relation.publicationcategoryKonferans Öğesi - Uluslararası - Kurum Öğretim Elemanıen_US
dc.rightsinfo:eu-repo/semantics/closedAccessen_US
dc.subjectMachine learningen_US
dc.subjectText miningen_US
dc.subjectBig dataen_US
dc.subjectApache Sparken_US
dc.subjectClassificationen_US
dc.subjectNaive Bayesen_US
dc.titleFast Text Classification with Naive Bayes Method on Apache Sparken_US
dc.typeConference Objecten_US

Dosyalar