Loading…

Analyzing potential tourist behavior using PCA and modified affinity propagation clustering based on Baidu index: taking Beijing city as an example

In recent years, when planning and determining a travel destination, residents often make the best of Internet techniques to access extensive travel information. Search engines undeniably reveal visitors' real-time preferences when planning to visit a destination. More and more researchers have...

Full description

Saved in:
Bibliographic Details
Published in:Data science and management 2021-06, Vol.2, p.12-19
Main Authors: Wang, Lin, Wang, Sirui, Yuan, Zhe, Peng, Lu
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by cdi_FETCH-LOGICAL-c3211-39527b0bb4177ba96e4d1d79437f5f6c89036fb8399b0e6b2a41727032680db93
cites cdi_FETCH-LOGICAL-c3211-39527b0bb4177ba96e4d1d79437f5f6c89036fb8399b0e6b2a41727032680db93
container_end_page 19
container_issue
container_start_page 12
container_title Data science and management
container_volume 2
creator Wang, Lin
Wang, Sirui
Yuan, Zhe
Peng, Lu
description In recent years, when planning and determining a travel destination, residents often make the best of Internet techniques to access extensive travel information. Search engines undeniably reveal visitors' real-time preferences when planning to visit a destination. More and more researchers have adopted tourism-related search engine data in the field of tourism prediction. However, few studies use search engine data to conduct cluster analysis to identify residents' choice toward a tourism destination. In the present study, 146 keywords related to “Beijing tourism” are obtained from Baidu index and principal component analysis (PCA) is applied to reduce the dimensionality of keywords obtained by Baidu index. Modified affinity propagation (MAP) clustering algorithm is used to classify provinces into several groups to identify the choice of residents to travel to Beijing. The result shows that residents in Hebei province are most likely to travel to Beijing. The cluster result also shows that PCA–MAP performs better than other clustering methods such as K-means, linkage, and Affinity Propogation (AP) in terms of silhouette coefficient and Calinski–Harabaz index. We also distinguish the difference of residents’ choice to travel to Beijing during the peak tourist season and off-season. The residents of Tianjing are inclined to travel to Beijing during the peak tourist season. The residents of Guangdong, Hebei, Henan, Jiangsu, Liaoning, Shanghai, Shandong, and Zhejiang have high attention to travel to Beijing during both seasons.
doi_str_mv 10.1016/j.dsm.2021.05.001
format article
fullrecord <record><control><sourceid>elsevier_doaj_</sourceid><recordid>TN_cdi_doaj_primary_oai_doaj_org_article_3e9fef6d7ec9449aa2756aec81d8e69b</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><els_id>S2666764921000138</els_id><doaj_id>oai_doaj_org_article_3e9fef6d7ec9449aa2756aec81d8e69b</doaj_id><sourcerecordid>S2666764921000138</sourcerecordid><originalsourceid>FETCH-LOGICAL-c3211-39527b0bb4177ba96e4d1d79437f5f6c89036fb8399b0e6b2a41727032680db93</originalsourceid><addsrcrecordid>eNp9kcFu1DAQhqOKSq1KH6A3v8AG20mcGE7bFZRKleAAZ2tsjxeHbBzZ3qrLa_DCOCxCnDjNaOb_P83or6o7RmtGmXgz1jYdak45q2lXU8ouqmsuhNj0opWv_umvqtuURkopHxjjnbiufm5nmE4__LwnS8g4Zw8TyeEYfcpE4zd49iGSY1oFn3dbArMlh2C982gJOOdnn09kiWGBPWQfZmKmY8oYV4OGVFRldg_eHomfLb68JRm-r8t79ONazQqAVMgEX-CwTPi6unQwJbz9U2-qrx_ef9l93Dx9enjcbZ82puGMbRrZ8V5TrVvW9xqkwNYy28u26V3nhBkkbYTTQyOlpig0hyLkPW24GKjVsrmpHs9cG2BUS_QHiCcVwKvfgxD3CmL2ZkLVoHTohO3RyLaVALzvBKAZmB1QSF1Y7MwyMaQU0f3lMarWkNSoSkhqDUnRTpWQiufd2YPlyWePUSXjcTZofUSTyxX-P-5fa_OcBg</addsrcrecordid><sourcetype>Open Website</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Analyzing potential tourist behavior using PCA and modified affinity propagation clustering based on Baidu index: taking Beijing city as an example</title><source>ScienceDirect Journals</source><creator>Wang, Lin ; Wang, Sirui ; Yuan, Zhe ; Peng, Lu</creator><creatorcontrib>Wang, Lin ; Wang, Sirui ; Yuan, Zhe ; Peng, Lu</creatorcontrib><description>In recent years, when planning and determining a travel destination, residents often make the best of Internet techniques to access extensive travel information. Search engines undeniably reveal visitors' real-time preferences when planning to visit a destination. More and more researchers have adopted tourism-related search engine data in the field of tourism prediction. However, few studies use search engine data to conduct cluster analysis to identify residents' choice toward a tourism destination. In the present study, 146 keywords related to “Beijing tourism” are obtained from Baidu index and principal component analysis (PCA) is applied to reduce the dimensionality of keywords obtained by Baidu index. Modified affinity propagation (MAP) clustering algorithm is used to classify provinces into several groups to identify the choice of residents to travel to Beijing. The result shows that residents in Hebei province are most likely to travel to Beijing. The cluster result also shows that PCA–MAP performs better than other clustering methods such as K-means, linkage, and Affinity Propogation (AP) in terms of silhouette coefficient and Calinski–Harabaz index. We also distinguish the difference of residents’ choice to travel to Beijing during the peak tourist season and off-season. The residents of Tianjing are inclined to travel to Beijing during the peak tourist season. The residents of Guangdong, Hebei, Henan, Jiangsu, Liaoning, Shanghai, Shandong, and Zhejiang have high attention to travel to Beijing during both seasons.</description><identifier>ISSN: 2666-7649</identifier><identifier>EISSN: 2666-7649</identifier><identifier>DOI: 10.1016/j.dsm.2021.05.001</identifier><language>eng</language><publisher>Elsevier B.V</publisher><subject>Affinity propagation ; Baidu index data ; Cluster analysis ; Principal component analysis (PCA)</subject><ispartof>Data science and management, 2021-06, Vol.2, p.12-19</ispartof><rights>2021 The Authors</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c3211-39527b0bb4177ba96e4d1d79437f5f6c89036fb8399b0e6b2a41727032680db93</citedby><cites>FETCH-LOGICAL-c3211-39527b0bb4177ba96e4d1d79437f5f6c89036fb8399b0e6b2a41727032680db93</cites><orcidid>0000-0003-0881-9689 ; 0000-0001-9054-1094 ; 0000-0002-7849-1490</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://www.sciencedirect.com/science/article/pii/S2666764921000138$$EHTML$$P50$$Gelsevier$$Hfree_for_read</linktohtml><link.rule.ids>314,776,780,3536,27900,27901,45755</link.rule.ids></links><search><creatorcontrib>Wang, Lin</creatorcontrib><creatorcontrib>Wang, Sirui</creatorcontrib><creatorcontrib>Yuan, Zhe</creatorcontrib><creatorcontrib>Peng, Lu</creatorcontrib><title>Analyzing potential tourist behavior using PCA and modified affinity propagation clustering based on Baidu index: taking Beijing city as an example</title><title>Data science and management</title><description>In recent years, when planning and determining a travel destination, residents often make the best of Internet techniques to access extensive travel information. Search engines undeniably reveal visitors' real-time preferences when planning to visit a destination. More and more researchers have adopted tourism-related search engine data in the field of tourism prediction. However, few studies use search engine data to conduct cluster analysis to identify residents' choice toward a tourism destination. In the present study, 146 keywords related to “Beijing tourism” are obtained from Baidu index and principal component analysis (PCA) is applied to reduce the dimensionality of keywords obtained by Baidu index. Modified affinity propagation (MAP) clustering algorithm is used to classify provinces into several groups to identify the choice of residents to travel to Beijing. The result shows that residents in Hebei province are most likely to travel to Beijing. The cluster result also shows that PCA–MAP performs better than other clustering methods such as K-means, linkage, and Affinity Propogation (AP) in terms of silhouette coefficient and Calinski–Harabaz index. We also distinguish the difference of residents’ choice to travel to Beijing during the peak tourist season and off-season. The residents of Tianjing are inclined to travel to Beijing during the peak tourist season. The residents of Guangdong, Hebei, Henan, Jiangsu, Liaoning, Shanghai, Shandong, and Zhejiang have high attention to travel to Beijing during both seasons.</description><subject>Affinity propagation</subject><subject>Baidu index data</subject><subject>Cluster analysis</subject><subject>Principal component analysis (PCA)</subject><issn>2666-7649</issn><issn>2666-7649</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2021</creationdate><recordtype>article</recordtype><sourceid>DOA</sourceid><recordid>eNp9kcFu1DAQhqOKSq1KH6A3v8AG20mcGE7bFZRKleAAZ2tsjxeHbBzZ3qrLa_DCOCxCnDjNaOb_P83or6o7RmtGmXgz1jYdak45q2lXU8ouqmsuhNj0opWv_umvqtuURkopHxjjnbiufm5nmE4__LwnS8g4Zw8TyeEYfcpE4zd49iGSY1oFn3dbArMlh2C982gJOOdnn09kiWGBPWQfZmKmY8oYV4OGVFRldg_eHomfLb68JRm-r8t79ONazQqAVMgEX-CwTPi6unQwJbz9U2-qrx_ef9l93Dx9enjcbZ82puGMbRrZ8V5TrVvW9xqkwNYy28u26V3nhBkkbYTTQyOlpig0hyLkPW24GKjVsrmpHs9cG2BUS_QHiCcVwKvfgxD3CmL2ZkLVoHTohO3RyLaVALzvBKAZmB1QSF1Y7MwyMaQU0f3lMarWkNSoSkhqDUnRTpWQiufd2YPlyWePUSXjcTZofUSTyxX-P-5fa_OcBg</recordid><startdate>202106</startdate><enddate>202106</enddate><creator>Wang, Lin</creator><creator>Wang, Sirui</creator><creator>Yuan, Zhe</creator><creator>Peng, Lu</creator><general>Elsevier B.V</general><general>KeAi Communications Co. Ltd</general><scope>6I.</scope><scope>AAFTH</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>DOA</scope><orcidid>https://orcid.org/0000-0003-0881-9689</orcidid><orcidid>https://orcid.org/0000-0001-9054-1094</orcidid><orcidid>https://orcid.org/0000-0002-7849-1490</orcidid></search><sort><creationdate>202106</creationdate><title>Analyzing potential tourist behavior using PCA and modified affinity propagation clustering based on Baidu index: taking Beijing city as an example</title><author>Wang, Lin ; Wang, Sirui ; Yuan, Zhe ; Peng, Lu</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c3211-39527b0bb4177ba96e4d1d79437f5f6c89036fb8399b0e6b2a41727032680db93</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2021</creationdate><topic>Affinity propagation</topic><topic>Baidu index data</topic><topic>Cluster analysis</topic><topic>Principal component analysis (PCA)</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Wang, Lin</creatorcontrib><creatorcontrib>Wang, Sirui</creatorcontrib><creatorcontrib>Yuan, Zhe</creatorcontrib><creatorcontrib>Peng, Lu</creatorcontrib><collection>ScienceDirect Open Access Titles</collection><collection>Elsevier:ScienceDirect:Open Access</collection><collection>CrossRef</collection><collection>DOAJ Directory of Open Access Journals</collection><jtitle>Data science and management</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Wang, Lin</au><au>Wang, Sirui</au><au>Yuan, Zhe</au><au>Peng, Lu</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Analyzing potential tourist behavior using PCA and modified affinity propagation clustering based on Baidu index: taking Beijing city as an example</atitle><jtitle>Data science and management</jtitle><date>2021-06</date><risdate>2021</risdate><volume>2</volume><spage>12</spage><epage>19</epage><pages>12-19</pages><issn>2666-7649</issn><eissn>2666-7649</eissn><abstract>In recent years, when planning and determining a travel destination, residents often make the best of Internet techniques to access extensive travel information. Search engines undeniably reveal visitors' real-time preferences when planning to visit a destination. More and more researchers have adopted tourism-related search engine data in the field of tourism prediction. However, few studies use search engine data to conduct cluster analysis to identify residents' choice toward a tourism destination. In the present study, 146 keywords related to “Beijing tourism” are obtained from Baidu index and principal component analysis (PCA) is applied to reduce the dimensionality of keywords obtained by Baidu index. Modified affinity propagation (MAP) clustering algorithm is used to classify provinces into several groups to identify the choice of residents to travel to Beijing. The result shows that residents in Hebei province are most likely to travel to Beijing. The cluster result also shows that PCA–MAP performs better than other clustering methods such as K-means, linkage, and Affinity Propogation (AP) in terms of silhouette coefficient and Calinski–Harabaz index. We also distinguish the difference of residents’ choice to travel to Beijing during the peak tourist season and off-season. The residents of Tianjing are inclined to travel to Beijing during the peak tourist season. The residents of Guangdong, Hebei, Henan, Jiangsu, Liaoning, Shanghai, Shandong, and Zhejiang have high attention to travel to Beijing during both seasons.</abstract><pub>Elsevier B.V</pub><doi>10.1016/j.dsm.2021.05.001</doi><tpages>8</tpages><orcidid>https://orcid.org/0000-0003-0881-9689</orcidid><orcidid>https://orcid.org/0000-0001-9054-1094</orcidid><orcidid>https://orcid.org/0000-0002-7849-1490</orcidid><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 2666-7649
ispartof Data science and management, 2021-06, Vol.2, p.12-19
issn 2666-7649
2666-7649
language eng
recordid cdi_doaj_primary_oai_doaj_org_article_3e9fef6d7ec9449aa2756aec81d8e69b
source ScienceDirect Journals
subjects Affinity propagation
Baidu index data
Cluster analysis
Principal component analysis (PCA)
title Analyzing potential tourist behavior using PCA and modified affinity propagation clustering based on Baidu index: taking Beijing city as an example
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-25T12%3A23%3A39IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-elsevier_doaj_&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Analyzing%20potential%20tourist%20behavior%20using%20PCA%20and%20modified%20affinity%20propagation%20clustering%20based%20on%20Baidu%20index:%20taking%20Beijing%20city%20as%20an%20example&rft.jtitle=Data%20science%20and%20management&rft.au=Wang,%20Lin&rft.date=2021-06&rft.volume=2&rft.spage=12&rft.epage=19&rft.pages=12-19&rft.issn=2666-7649&rft.eissn=2666-7649&rft_id=info:doi/10.1016/j.dsm.2021.05.001&rft_dat=%3Celsevier_doaj_%3ES2666764921000138%3C/elsevier_doaj_%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c3211-39527b0bb4177ba96e4d1d79437f5f6c89036fb8399b0e6b2a41727032680db93%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true