Logical analysis of built-in DBSCAN Functions in Popular Data Science Programming Languages

MIST Central Library Repository

Show simple item record

dc.contributor.author Amiruzzaman, Md
dc.contributor.author Rahman, Rashik
dc.contributor.author Islam, Md. Rajibul
dc.contributor.author Mohd Nor, Rizal
dc.date.accessioned 2023-01-22T05:29:54Z
dc.date.available 2023-01-22T05:29:54Z
dc.date.issued 2022-06
dc.identifier.issn 2224-2007
dc.identifier.uri http://dspace.mist.ac.bd:8080/xmlui/handle/123456789/740
dc.description.abstract DBSCAN algorithm is a location-based clustering approach; it is used to find relationships and patterns in geographical data. Because of its widespread application, several data science-based programming languages include the DBSCAN method as a built-in function. Researchers and data scientists have been clustering and analyzing their study data using the built-in DBSCAN functions. All implementations of the DBSCAN functions require user input for radius distance (i.e., eps) and a minimum number of samples for a cluster (i.e., min_sample). As a result, the result of all built-in DBSCAN functions is believed to be the same. However, the DBSCAN Python built-in function yields different results than the other programming languages those are analyzed in this study. We propose a scientific way to assess the results of DBSCAN built-in function, as well as output inconsistencies. This study reveals various differences and advises caution when working with built-in functionality. en_US
dc.language.iso en en_US
dc.publisher Research and Development Wing, MIST en_US
dc.subject Clustering, DBSCAN, Geo-coordinates, Machine learning, Spatial en_US
dc.title Logical analysis of built-in DBSCAN Functions in Popular Data Science Programming Languages en_US
dc.type Article en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search DSpace


Advanced Search

Browse

My Account