Military Institute of Science and Technology
MIST Digital Archive

Logical analysis of built-in DBSCAN Functions in Popular Data Science Programming Languages

dc.contributor.authorAmiruzzaman, Md
dc.contributor.authorRahman, Rashik
dc.contributor.authorIslam, Md. Rajibul
dc.contributor.authorMohd Nor, Rizal
dc.date.accessioned2023-01-22T05:29:54Z
dc.date.available2023-01-22T05:29:54Z
dc.date.issued2022-06
dc.description.abstractDBSCAN algorithm is a location-based clustering approach; it is used to find relationships and patterns in geographical data. Because of its widespread application, several data science-based programming languages include the DBSCAN method as a built-in function. Researchers and data scientists have been clustering and analyzing their study data using the built-in DBSCAN functions. All implementations of the DBSCAN functions require user input for radius distance (i.e., eps) and a minimum number of samples for a cluster (i.e., min_sample). As a result, the result of all built-in DBSCAN functions is believed to be the same. However, the DBSCAN Python built-in function yields different results than the other programming languages those are analyzed in this study. We propose a scientific way to assess the results of DBSCAN built-in function, as well as output inconsistencies. This study reveals various differences and advises caution when working with built-in functionality.en_US
dc.identifier.issn2224-2007
dc.identifier.urihttp://dspace.mist.ac.bd:8080/xmlui/handle/123456789/740
dc.language.isoenen_US
dc.publisherResearch and Development Wing, MISTen_US
dc.subjectClustering, DBSCAN, Geo-coordinates, Machine learning, Spatialen_US
dc.titleLogical analysis of built-in DBSCAN Functions in Popular Data Science Programming Languagesen_US
dc.typeArticleen_US

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
349-Final Manuscript (for correction)-2251-3-10-20220615.pdf
Size:
627.38 KB
Format:
Adobe Portable Document Format
Description:
Logical analysis of built-in DBSCAN Functions in Popular Data Science Programming Languages

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: