Loading…

Prediction of patent grant and interpreting the key determinants: an application of interpretable machine learning approach

Patents are valuable intellectual property only when granted by the governments, and failing to receive an official grant means disclosing valuable technologies and information, which otherwise would be kept as commercial secrets. Yet, a typical patent application process takes years to complete and...

Full description

Saved in:
Bibliographic Details
Published in:Scientometrics 2023-09, Vol.128 (9), p.4933-4969
Main Authors: Yao, Li, Ni, He
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by cdi_FETCH-LOGICAL-c319t-ac33e44a7296278ad56cef68483b536a67c219a3f33b3ad634c53ea5d677a73a3
cites cdi_FETCH-LOGICAL-c319t-ac33e44a7296278ad56cef68483b536a67c219a3f33b3ad634c53ea5d677a73a3
container_end_page 4969
container_issue 9
container_start_page 4933
container_title Scientometrics
container_volume 128
creator Yao, Li
Ni, He
description Patents are valuable intellectual property only when granted by the governments, and failing to receive an official grant means disclosing valuable technologies and information, which otherwise would be kept as commercial secrets. Yet, a typical patent application process takes years to complete and the outcome is uncertain. This study implements machine learning models to predict patent examination outcomes based on early information disclosed at patent publication and interpret the mechanism of how these models make predictions, highlighting the key determinants to patent grant and delineating the relationships between the patent features and the examination outcome. The predictive models that integrate patent-level variables with textual information accomplish the best prediction performances with a 0.854 ROC-AUC score and 77% accuracy rate. A number of interpretable machine learning methods are applied. The permutation-based feature importance metric identifies key determinants such as applicants’ prior experience, page length, backward citation, claim counts, number of patent family, etc. SHAP (SHapley Additive exPlanations), a local interpretability method, describes the marginal contributions to the model prediction of key predictors using two actual patent examples. Our study provides several valuable findings with important theoretical insights and practical applications. Specifically, we show that patent-level information can serve as a predictor of examination outcomes and the relationships between the predictors and outcome variables are complex. Knowledge accumulation and technology complexity positively affect the likelihood of patent grants, albeit with a curvilinear relationship. At lower levels, both factors significantly increase the chance of a grant, but beyond a certain threshold, the marginal effect becomes less pronounced. Additionally, prior experience, patent family size, and engagement with the patent agency have a monotonic and positive relationship with the grant likelihood, whereas the impact of patent scope on patent grants remains uncertain. While a narrower and more specific patent claim is associated with a higher grant rate, the number of claims increases it. Moreover, technology range, inventor team size, and examination duration have little effect on the patent grant results. From a practical standpoint, the accurate prediction of patent grants has significant potential applications. For instance, it could help firms bet
doi_str_mv 10.1007/s11192-023-04736-z
format article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2848980967</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2848980967</sourcerecordid><originalsourceid>FETCH-LOGICAL-c319t-ac33e44a7296278ad56cef68483b536a67c219a3f33b3ad634c53ea5d677a73a3</originalsourceid><addsrcrecordid>eNp9kMtOwzAQRS0EEqXwA6wssQ7YmcR22CHES6oEC1hbU2fSBlon2Omi5edxKY8dmxlpdO4Z6TJ2KsW5FEJfRClllWcih0wUGlS22WMjWRqT5UbJfTYSEkxWSRCH7CjGV5FCIMyIfTwFqls3tJ3nXcN7HMgPfBYwTfQ1b_1AoQ80tH7GhznxN1rzmtJx2foExcuEcez7Revwx_IbwumC-BLdvPXEF4TBbzWJDl06HrODBheRTr73mL3c3jxf32eTx7uH66tJ5kBWQ4YOgIoCdV6pXBusS-WoUaYwMC1BodIulxVCAzAFrBUUrgTCslZaowaEMTvbedPb9xXFwb52q-DTS5snS2VEpXSi8h3lQhdjoMb2oV1iWFsp7LZkuyvZppLtV8l2k0KwC8UE-xmFP_U_qU98uoJ4</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2848980967</pqid></control><display><type>article</type><title>Prediction of patent grant and interpreting the key determinants: an application of interpretable machine learning approach</title><source>Library &amp; Information Science Abstracts (LISA)</source><source>Springer Nature</source><creator>Yao, Li ; Ni, He</creator><creatorcontrib>Yao, Li ; Ni, He</creatorcontrib><description>Patents are valuable intellectual property only when granted by the governments, and failing to receive an official grant means disclosing valuable technologies and information, which otherwise would be kept as commercial secrets. Yet, a typical patent application process takes years to complete and the outcome is uncertain. This study implements machine learning models to predict patent examination outcomes based on early information disclosed at patent publication and interpret the mechanism of how these models make predictions, highlighting the key determinants to patent grant and delineating the relationships between the patent features and the examination outcome. The predictive models that integrate patent-level variables with textual information accomplish the best prediction performances with a 0.854 ROC-AUC score and 77% accuracy rate. A number of interpretable machine learning methods are applied. The permutation-based feature importance metric identifies key determinants such as applicants’ prior experience, page length, backward citation, claim counts, number of patent family, etc. SHAP (SHapley Additive exPlanations), a local interpretability method, describes the marginal contributions to the model prediction of key predictors using two actual patent examples. Our study provides several valuable findings with important theoretical insights and practical applications. Specifically, we show that patent-level information can serve as a predictor of examination outcomes and the relationships between the predictors and outcome variables are complex. Knowledge accumulation and technology complexity positively affect the likelihood of patent grants, albeit with a curvilinear relationship. At lower levels, both factors significantly increase the chance of a grant, but beyond a certain threshold, the marginal effect becomes less pronounced. Additionally, prior experience, patent family size, and engagement with the patent agency have a monotonic and positive relationship with the grant likelihood, whereas the impact of patent scope on patent grants remains uncertain. While a narrower and more specific patent claim is associated with a higher grant rate, the number of claims increases it. Moreover, technology range, inventor team size, and examination duration have little effect on the patent grant results. From a practical standpoint, the accurate prediction of patent grants has significant potential applications. For instance, it could help firms better prioritize resources on the patent applications of high grant potentials to secure the final grant, as failure means a waste of R &amp;D effort and disclosure of technology without IP protection. Additionally, patent examiners could utilize our predictive results as prior knowledge to enhance their judgment and accelerate the examination process.</description><identifier>ISSN: 0138-9130</identifier><identifier>EISSN: 1588-2861</identifier><identifier>DOI: 10.1007/s11192-023-04736-z</identifier><language>eng</language><publisher>Cham: Springer International Publishing</publisher><subject>Complex variables ; Complexity ; Computer Science ; Family size ; Grants ; Information Storage and Retrieval ; Intellectual property ; Inventors ; Learning algorithms ; Library Science ; Machine learning ; Patent applications ; Patent searches ; Permutations ; Prediction models ; Team size ; Technology</subject><ispartof>Scientometrics, 2023-09, Vol.128 (9), p.4933-4969</ispartof><rights>Akadémiai Kiadó, Budapest, Hungary 2023. Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c319t-ac33e44a7296278ad56cef68483b536a67c219a3f33b3ad634c53ea5d677a73a3</citedby><cites>FETCH-LOGICAL-c319t-ac33e44a7296278ad56cef68483b536a67c219a3f33b3ad634c53ea5d677a73a3</cites><orcidid>0000-0001-8054-4523</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,776,780,27901,27902,34112</link.rule.ids></links><search><creatorcontrib>Yao, Li</creatorcontrib><creatorcontrib>Ni, He</creatorcontrib><title>Prediction of patent grant and interpreting the key determinants: an application of interpretable machine learning approach</title><title>Scientometrics</title><addtitle>Scientometrics</addtitle><description>Patents are valuable intellectual property only when granted by the governments, and failing to receive an official grant means disclosing valuable technologies and information, which otherwise would be kept as commercial secrets. Yet, a typical patent application process takes years to complete and the outcome is uncertain. This study implements machine learning models to predict patent examination outcomes based on early information disclosed at patent publication and interpret the mechanism of how these models make predictions, highlighting the key determinants to patent grant and delineating the relationships between the patent features and the examination outcome. The predictive models that integrate patent-level variables with textual information accomplish the best prediction performances with a 0.854 ROC-AUC score and 77% accuracy rate. A number of interpretable machine learning methods are applied. The permutation-based feature importance metric identifies key determinants such as applicants’ prior experience, page length, backward citation, claim counts, number of patent family, etc. SHAP (SHapley Additive exPlanations), a local interpretability method, describes the marginal contributions to the model prediction of key predictors using two actual patent examples. Our study provides several valuable findings with important theoretical insights and practical applications. Specifically, we show that patent-level information can serve as a predictor of examination outcomes and the relationships between the predictors and outcome variables are complex. Knowledge accumulation and technology complexity positively affect the likelihood of patent grants, albeit with a curvilinear relationship. At lower levels, both factors significantly increase the chance of a grant, but beyond a certain threshold, the marginal effect becomes less pronounced. Additionally, prior experience, patent family size, and engagement with the patent agency have a monotonic and positive relationship with the grant likelihood, whereas the impact of patent scope on patent grants remains uncertain. While a narrower and more specific patent claim is associated with a higher grant rate, the number of claims increases it. Moreover, technology range, inventor team size, and examination duration have little effect on the patent grant results. From a practical standpoint, the accurate prediction of patent grants has significant potential applications. For instance, it could help firms better prioritize resources on the patent applications of high grant potentials to secure the final grant, as failure means a waste of R &amp;D effort and disclosure of technology without IP protection. Additionally, patent examiners could utilize our predictive results as prior knowledge to enhance their judgment and accelerate the examination process.</description><subject>Complex variables</subject><subject>Complexity</subject><subject>Computer Science</subject><subject>Family size</subject><subject>Grants</subject><subject>Information Storage and Retrieval</subject><subject>Intellectual property</subject><subject>Inventors</subject><subject>Learning algorithms</subject><subject>Library Science</subject><subject>Machine learning</subject><subject>Patent applications</subject><subject>Patent searches</subject><subject>Permutations</subject><subject>Prediction models</subject><subject>Team size</subject><subject>Technology</subject><issn>0138-9130</issn><issn>1588-2861</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2023</creationdate><recordtype>article</recordtype><sourceid>F2A</sourceid><recordid>eNp9kMtOwzAQRS0EEqXwA6wssQ7YmcR22CHES6oEC1hbU2fSBlon2Omi5edxKY8dmxlpdO4Z6TJ2KsW5FEJfRClllWcih0wUGlS22WMjWRqT5UbJfTYSEkxWSRCH7CjGV5FCIMyIfTwFqls3tJ3nXcN7HMgPfBYwTfQ1b_1AoQ80tH7GhznxN1rzmtJx2foExcuEcez7Revwx_IbwumC-BLdvPXEF4TBbzWJDl06HrODBheRTr73mL3c3jxf32eTx7uH66tJ5kBWQ4YOgIoCdV6pXBusS-WoUaYwMC1BodIulxVCAzAFrBUUrgTCslZaowaEMTvbedPb9xXFwb52q-DTS5snS2VEpXSi8h3lQhdjoMb2oV1iWFsp7LZkuyvZppLtV8l2k0KwC8UE-xmFP_U_qU98uoJ4</recordid><startdate>20230901</startdate><enddate>20230901</enddate><creator>Yao, Li</creator><creator>Ni, He</creator><general>Springer International Publishing</general><general>Springer Nature B.V</general><scope>AAYXX</scope><scope>CITATION</scope><scope>E3H</scope><scope>F2A</scope><orcidid>https://orcid.org/0000-0001-8054-4523</orcidid></search><sort><creationdate>20230901</creationdate><title>Prediction of patent grant and interpreting the key determinants: an application of interpretable machine learning approach</title><author>Yao, Li ; Ni, He</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c319t-ac33e44a7296278ad56cef68483b536a67c219a3f33b3ad634c53ea5d677a73a3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2023</creationdate><topic>Complex variables</topic><topic>Complexity</topic><topic>Computer Science</topic><topic>Family size</topic><topic>Grants</topic><topic>Information Storage and Retrieval</topic><topic>Intellectual property</topic><topic>Inventors</topic><topic>Learning algorithms</topic><topic>Library Science</topic><topic>Machine learning</topic><topic>Patent applications</topic><topic>Patent searches</topic><topic>Permutations</topic><topic>Prediction models</topic><topic>Team size</topic><topic>Technology</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Yao, Li</creatorcontrib><creatorcontrib>Ni, He</creatorcontrib><collection>CrossRef</collection><collection>Library &amp; Information Sciences Abstracts (LISA)</collection><collection>Library &amp; Information Science Abstracts (LISA)</collection><jtitle>Scientometrics</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Yao, Li</au><au>Ni, He</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Prediction of patent grant and interpreting the key determinants: an application of interpretable machine learning approach</atitle><jtitle>Scientometrics</jtitle><stitle>Scientometrics</stitle><date>2023-09-01</date><risdate>2023</risdate><volume>128</volume><issue>9</issue><spage>4933</spage><epage>4969</epage><pages>4933-4969</pages><issn>0138-9130</issn><eissn>1588-2861</eissn><abstract>Patents are valuable intellectual property only when granted by the governments, and failing to receive an official grant means disclosing valuable technologies and information, which otherwise would be kept as commercial secrets. Yet, a typical patent application process takes years to complete and the outcome is uncertain. This study implements machine learning models to predict patent examination outcomes based on early information disclosed at patent publication and interpret the mechanism of how these models make predictions, highlighting the key determinants to patent grant and delineating the relationships between the patent features and the examination outcome. The predictive models that integrate patent-level variables with textual information accomplish the best prediction performances with a 0.854 ROC-AUC score and 77% accuracy rate. A number of interpretable machine learning methods are applied. The permutation-based feature importance metric identifies key determinants such as applicants’ prior experience, page length, backward citation, claim counts, number of patent family, etc. SHAP (SHapley Additive exPlanations), a local interpretability method, describes the marginal contributions to the model prediction of key predictors using two actual patent examples. Our study provides several valuable findings with important theoretical insights and practical applications. Specifically, we show that patent-level information can serve as a predictor of examination outcomes and the relationships between the predictors and outcome variables are complex. Knowledge accumulation and technology complexity positively affect the likelihood of patent grants, albeit with a curvilinear relationship. At lower levels, both factors significantly increase the chance of a grant, but beyond a certain threshold, the marginal effect becomes less pronounced. Additionally, prior experience, patent family size, and engagement with the patent agency have a monotonic and positive relationship with the grant likelihood, whereas the impact of patent scope on patent grants remains uncertain. While a narrower and more specific patent claim is associated with a higher grant rate, the number of claims increases it. Moreover, technology range, inventor team size, and examination duration have little effect on the patent grant results. From a practical standpoint, the accurate prediction of patent grants has significant potential applications. For instance, it could help firms better prioritize resources on the patent applications of high grant potentials to secure the final grant, as failure means a waste of R &amp;D effort and disclosure of technology without IP protection. Additionally, patent examiners could utilize our predictive results as prior knowledge to enhance their judgment and accelerate the examination process.</abstract><cop>Cham</cop><pub>Springer International Publishing</pub><doi>10.1007/s11192-023-04736-z</doi><tpages>37</tpages><orcidid>https://orcid.org/0000-0001-8054-4523</orcidid></addata></record>
fulltext fulltext
identifier ISSN: 0138-9130
ispartof Scientometrics, 2023-09, Vol.128 (9), p.4933-4969
issn 0138-9130
1588-2861
language eng
recordid cdi_proquest_journals_2848980967
source Library & Information Science Abstracts (LISA); Springer Nature
subjects Complex variables
Complexity
Computer Science
Family size
Grants
Information Storage and Retrieval
Intellectual property
Inventors
Learning algorithms
Library Science
Machine learning
Patent applications
Patent searches
Permutations
Prediction models
Team size
Technology
title Prediction of patent grant and interpreting the key determinants: an application of interpretable machine learning approach
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-13T02%3A21%3A34IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Prediction%20of%20patent%20grant%20and%20interpreting%20the%20key%20determinants:%20an%20application%20of%20interpretable%20machine%20learning%20approach&rft.jtitle=Scientometrics&rft.au=Yao,%20Li&rft.date=2023-09-01&rft.volume=128&rft.issue=9&rft.spage=4933&rft.epage=4969&rft.pages=4933-4969&rft.issn=0138-9130&rft.eissn=1588-2861&rft_id=info:doi/10.1007/s11192-023-04736-z&rft_dat=%3Cproquest_cross%3E2848980967%3C/proquest_cross%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c319t-ac33e44a7296278ad56cef68483b536a67c219a3f33b3ad634c53ea5d677a73a3%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=2848980967&rft_id=info:pmid/&rfr_iscdi=true