Loading…
Analyzing potential tourist behavior using PCA and modified affinity propagation clustering based on Baidu index: taking Beijing city as an example
In recent years, when planning and determining a travel destination, residents often make the best of Internet techniques to access extensive travel information. Search engines undeniably reveal visitors' real-time preferences when planning to visit a destination. More and more researchers have...
Saved in:
Published in: | Data science and management 2021-06, Vol.2, p.12-19 |
---|---|
Main Authors: | , , , |
Format: | Article |
Language: | English |
Subjects: | |
Citations: | Items that this one cites Items that cite this one |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
cited_by | cdi_FETCH-LOGICAL-c3211-39527b0bb4177ba96e4d1d79437f5f6c89036fb8399b0e6b2a41727032680db93 |
---|---|
cites | cdi_FETCH-LOGICAL-c3211-39527b0bb4177ba96e4d1d79437f5f6c89036fb8399b0e6b2a41727032680db93 |
container_end_page | 19 |
container_issue | |
container_start_page | 12 |
container_title | Data science and management |
container_volume | 2 |
creator | Wang, Lin Wang, Sirui Yuan, Zhe Peng, Lu |
description | In recent years, when planning and determining a travel destination, residents often make the best of Internet techniques to access extensive travel information. Search engines undeniably reveal visitors' real-time preferences when planning to visit a destination. More and more researchers have adopted tourism-related search engine data in the field of tourism prediction. However, few studies use search engine data to conduct cluster analysis to identify residents' choice toward a tourism destination. In the present study, 146 keywords related to “Beijing tourism” are obtained from Baidu index and principal component analysis (PCA) is applied to reduce the dimensionality of keywords obtained by Baidu index. Modified affinity propagation (MAP) clustering algorithm is used to classify provinces into several groups to identify the choice of residents to travel to Beijing. The result shows that residents in Hebei province are most likely to travel to Beijing. The cluster result also shows that PCA–MAP performs better than other clustering methods such as K-means, linkage, and Affinity Propogation (AP) in terms of silhouette coefficient and Calinski–Harabaz index. We also distinguish the difference of residents’ choice to travel to Beijing during the peak tourist season and off-season. The residents of Tianjing are inclined to travel to Beijing during the peak tourist season. The residents of Guangdong, Hebei, Henan, Jiangsu, Liaoning, Shanghai, Shandong, and Zhejiang have high attention to travel to Beijing during both seasons. |
doi_str_mv | 10.1016/j.dsm.2021.05.001 |
format | article |
fullrecord | <record><control><sourceid>elsevier_doaj_</sourceid><recordid>TN_cdi_doaj_primary_oai_doaj_org_article_3e9fef6d7ec9449aa2756aec81d8e69b</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><els_id>S2666764921000138</els_id><doaj_id>oai_doaj_org_article_3e9fef6d7ec9449aa2756aec81d8e69b</doaj_id><sourcerecordid>S2666764921000138</sourcerecordid><originalsourceid>FETCH-LOGICAL-c3211-39527b0bb4177ba96e4d1d79437f5f6c89036fb8399b0e6b2a41727032680db93</originalsourceid><addsrcrecordid>eNp9kcFu1DAQhqOKSq1KH6A3v8AG20mcGE7bFZRKleAAZ2tsjxeHbBzZ3qrLa_DCOCxCnDjNaOb_P83or6o7RmtGmXgz1jYdak45q2lXU8ouqmsuhNj0opWv_umvqtuURkopHxjjnbiufm5nmE4__LwnS8g4Zw8TyeEYfcpE4zd49iGSY1oFn3dbArMlh2C982gJOOdnn09kiWGBPWQfZmKmY8oYV4OGVFRldg_eHomfLb68JRm-r8t79ONazQqAVMgEX-CwTPi6unQwJbz9U2-qrx_ef9l93Dx9enjcbZ82puGMbRrZ8V5TrVvW9xqkwNYy28u26V3nhBkkbYTTQyOlpig0hyLkPW24GKjVsrmpHs9cG2BUS_QHiCcVwKvfgxD3CmL2ZkLVoHTohO3RyLaVALzvBKAZmB1QSF1Y7MwyMaQU0f3lMarWkNSoSkhqDUnRTpWQiufd2YPlyWePUSXjcTZofUSTyxX-P-5fa_OcBg</addsrcrecordid><sourcetype>Open Website</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Analyzing potential tourist behavior using PCA and modified affinity propagation clustering based on Baidu index: taking Beijing city as an example</title><source>ScienceDirect Journals</source><creator>Wang, Lin ; Wang, Sirui ; Yuan, Zhe ; Peng, Lu</creator><creatorcontrib>Wang, Lin ; Wang, Sirui ; Yuan, Zhe ; Peng, Lu</creatorcontrib><description>In recent years, when planning and determining a travel destination, residents often make the best of Internet techniques to access extensive travel information. Search engines undeniably reveal visitors' real-time preferences when planning to visit a destination. More and more researchers have adopted tourism-related search engine data in the field of tourism prediction. However, few studies use search engine data to conduct cluster analysis to identify residents' choice toward a tourism destination. In the present study, 146 keywords related to “Beijing tourism” are obtained from Baidu index and principal component analysis (PCA) is applied to reduce the dimensionality of keywords obtained by Baidu index. Modified affinity propagation (MAP) clustering algorithm is used to classify provinces into several groups to identify the choice of residents to travel to Beijing. The result shows that residents in Hebei province are most likely to travel to Beijing. The cluster result also shows that PCA–MAP performs better than other clustering methods such as K-means, linkage, and Affinity Propogation (AP) in terms of silhouette coefficient and Calinski–Harabaz index. We also distinguish the difference of residents’ choice to travel to Beijing during the peak tourist season and off-season. The residents of Tianjing are inclined to travel to Beijing during the peak tourist season. The residents of Guangdong, Hebei, Henan, Jiangsu, Liaoning, Shanghai, Shandong, and Zhejiang have high attention to travel to Beijing during both seasons.</description><identifier>ISSN: 2666-7649</identifier><identifier>EISSN: 2666-7649</identifier><identifier>DOI: 10.1016/j.dsm.2021.05.001</identifier><language>eng</language><publisher>Elsevier B.V</publisher><subject>Affinity propagation ; Baidu index data ; Cluster analysis ; Principal component analysis (PCA)</subject><ispartof>Data science and management, 2021-06, Vol.2, p.12-19</ispartof><rights>2021 The Authors</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c3211-39527b0bb4177ba96e4d1d79437f5f6c89036fb8399b0e6b2a41727032680db93</citedby><cites>FETCH-LOGICAL-c3211-39527b0bb4177ba96e4d1d79437f5f6c89036fb8399b0e6b2a41727032680db93</cites><orcidid>0000-0003-0881-9689 ; 0000-0001-9054-1094 ; 0000-0002-7849-1490</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://www.sciencedirect.com/science/article/pii/S2666764921000138$$EHTML$$P50$$Gelsevier$$Hfree_for_read</linktohtml><link.rule.ids>314,776,780,3536,27900,27901,45755</link.rule.ids></links><search><creatorcontrib>Wang, Lin</creatorcontrib><creatorcontrib>Wang, Sirui</creatorcontrib><creatorcontrib>Yuan, Zhe</creatorcontrib><creatorcontrib>Peng, Lu</creatorcontrib><title>Analyzing potential tourist behavior using PCA and modified affinity propagation clustering based on Baidu index: taking Beijing city as an example</title><title>Data science and management</title><description>In recent years, when planning and determining a travel destination, residents often make the best of Internet techniques to access extensive travel information. Search engines undeniably reveal visitors' real-time preferences when planning to visit a destination. More and more researchers have adopted tourism-related search engine data in the field of tourism prediction. However, few studies use search engine data to conduct cluster analysis to identify residents' choice toward a tourism destination. In the present study, 146 keywords related to “Beijing tourism” are obtained from Baidu index and principal component analysis (PCA) is applied to reduce the dimensionality of keywords obtained by Baidu index. Modified affinity propagation (MAP) clustering algorithm is used to classify provinces into several groups to identify the choice of residents to travel to Beijing. The result shows that residents in Hebei province are most likely to travel to Beijing. The cluster result also shows that PCA–MAP performs better than other clustering methods such as K-means, linkage, and Affinity Propogation (AP) in terms of silhouette coefficient and Calinski–Harabaz index. We also distinguish the difference of residents’ choice to travel to Beijing during the peak tourist season and off-season. The residents of Tianjing are inclined to travel to Beijing during the peak tourist season. The residents of Guangdong, Hebei, Henan, Jiangsu, Liaoning, Shanghai, Shandong, and Zhejiang have high attention to travel to Beijing during both seasons.</description><subject>Affinity propagation</subject><subject>Baidu index data</subject><subject>Cluster analysis</subject><subject>Principal component analysis (PCA)</subject><issn>2666-7649</issn><issn>2666-7649</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2021</creationdate><recordtype>article</recordtype><sourceid>DOA</sourceid><recordid>eNp9kcFu1DAQhqOKSq1KH6A3v8AG20mcGE7bFZRKleAAZ2tsjxeHbBzZ3qrLa_DCOCxCnDjNaOb_P83or6o7RmtGmXgz1jYdak45q2lXU8ouqmsuhNj0opWv_umvqtuURkopHxjjnbiufm5nmE4__LwnS8g4Zw8TyeEYfcpE4zd49iGSY1oFn3dbArMlh2C982gJOOdnn09kiWGBPWQfZmKmY8oYV4OGVFRldg_eHomfLb68JRm-r8t79ONazQqAVMgEX-CwTPi6unQwJbz9U2-qrx_ef9l93Dx9enjcbZ82puGMbRrZ8V5TrVvW9xqkwNYy28u26V3nhBkkbYTTQyOlpig0hyLkPW24GKjVsrmpHs9cG2BUS_QHiCcVwKvfgxD3CmL2ZkLVoHTohO3RyLaVALzvBKAZmB1QSF1Y7MwyMaQU0f3lMarWkNSoSkhqDUnRTpWQiufd2YPlyWePUSXjcTZofUSTyxX-P-5fa_OcBg</recordid><startdate>202106</startdate><enddate>202106</enddate><creator>Wang, Lin</creator><creator>Wang, Sirui</creator><creator>Yuan, Zhe</creator><creator>Peng, Lu</creator><general>Elsevier B.V</general><general>KeAi Communications Co. Ltd</general><scope>6I.</scope><scope>AAFTH</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>DOA</scope><orcidid>https://orcid.org/0000-0003-0881-9689</orcidid><orcidid>https://orcid.org/0000-0001-9054-1094</orcidid><orcidid>https://orcid.org/0000-0002-7849-1490</orcidid></search><sort><creationdate>202106</creationdate><title>Analyzing potential tourist behavior using PCA and modified affinity propagation clustering based on Baidu index: taking Beijing city as an example</title><author>Wang, Lin ; Wang, Sirui ; Yuan, Zhe ; Peng, Lu</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c3211-39527b0bb4177ba96e4d1d79437f5f6c89036fb8399b0e6b2a41727032680db93</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2021</creationdate><topic>Affinity propagation</topic><topic>Baidu index data</topic><topic>Cluster analysis</topic><topic>Principal component analysis (PCA)</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Wang, Lin</creatorcontrib><creatorcontrib>Wang, Sirui</creatorcontrib><creatorcontrib>Yuan, Zhe</creatorcontrib><creatorcontrib>Peng, Lu</creatorcontrib><collection>ScienceDirect Open Access Titles</collection><collection>Elsevier:ScienceDirect:Open Access</collection><collection>CrossRef</collection><collection>DOAJ Directory of Open Access Journals</collection><jtitle>Data science and management</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Wang, Lin</au><au>Wang, Sirui</au><au>Yuan, Zhe</au><au>Peng, Lu</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Analyzing potential tourist behavior using PCA and modified affinity propagation clustering based on Baidu index: taking Beijing city as an example</atitle><jtitle>Data science and management</jtitle><date>2021-06</date><risdate>2021</risdate><volume>2</volume><spage>12</spage><epage>19</epage><pages>12-19</pages><issn>2666-7649</issn><eissn>2666-7649</eissn><abstract>In recent years, when planning and determining a travel destination, residents often make the best of Internet techniques to access extensive travel information. Search engines undeniably reveal visitors' real-time preferences when planning to visit a destination. More and more researchers have adopted tourism-related search engine data in the field of tourism prediction. However, few studies use search engine data to conduct cluster analysis to identify residents' choice toward a tourism destination. In the present study, 146 keywords related to “Beijing tourism” are obtained from Baidu index and principal component analysis (PCA) is applied to reduce the dimensionality of keywords obtained by Baidu index. Modified affinity propagation (MAP) clustering algorithm is used to classify provinces into several groups to identify the choice of residents to travel to Beijing. The result shows that residents in Hebei province are most likely to travel to Beijing. The cluster result also shows that PCA–MAP performs better than other clustering methods such as K-means, linkage, and Affinity Propogation (AP) in terms of silhouette coefficient and Calinski–Harabaz index. We also distinguish the difference of residents’ choice to travel to Beijing during the peak tourist season and off-season. The residents of Tianjing are inclined to travel to Beijing during the peak tourist season. The residents of Guangdong, Hebei, Henan, Jiangsu, Liaoning, Shanghai, Shandong, and Zhejiang have high attention to travel to Beijing during both seasons.</abstract><pub>Elsevier B.V</pub><doi>10.1016/j.dsm.2021.05.001</doi><tpages>8</tpages><orcidid>https://orcid.org/0000-0003-0881-9689</orcidid><orcidid>https://orcid.org/0000-0001-9054-1094</orcidid><orcidid>https://orcid.org/0000-0002-7849-1490</orcidid><oa>free_for_read</oa></addata></record> |
fulltext | fulltext |
identifier | ISSN: 2666-7649 |
ispartof | Data science and management, 2021-06, Vol.2, p.12-19 |
issn | 2666-7649 2666-7649 |
language | eng |
recordid | cdi_doaj_primary_oai_doaj_org_article_3e9fef6d7ec9449aa2756aec81d8e69b |
source | ScienceDirect Journals |
subjects | Affinity propagation Baidu index data Cluster analysis Principal component analysis (PCA) |
title | Analyzing potential tourist behavior using PCA and modified affinity propagation clustering based on Baidu index: taking Beijing city as an example |
url | http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-25T12%3A23%3A39IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-elsevier_doaj_&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Analyzing%20potential%20tourist%20behavior%20using%20PCA%20and%20modified%20affinity%20propagation%20clustering%20based%20on%20Baidu%20index:%20taking%20Beijing%20city%20as%20an%20example&rft.jtitle=Data%20science%20and%20management&rft.au=Wang,%20Lin&rft.date=2021-06&rft.volume=2&rft.spage=12&rft.epage=19&rft.pages=12-19&rft.issn=2666-7649&rft.eissn=2666-7649&rft_id=info:doi/10.1016/j.dsm.2021.05.001&rft_dat=%3Celsevier_doaj_%3ES2666764921000138%3C/elsevier_doaj_%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c3211-39527b0bb4177ba96e4d1d79437f5f6c89036fb8399b0e6b2a41727032680db93%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true |