Loading…

Enfocando la Sinonimia sobre Repositorios de Código Fuente

We expect that a huge corpus of code could be rich in patterns, and the corpus of software has statistical properties, which are similar to the corpus of natural language. In spite of code and text are similar, written code is a new problem domain for the techniques of natural language processing. T...

Full description

Saved in:
Bibliographic Details
Published in:RISTI : Revista Ibérica de Sistemas e Tecnologias de Informação 2020-07 (E31), p.573-589
Main Author: Del Carpio, Paul Mendoza
Format: Article
Language:Spanish
Subjects:
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:We expect that a huge corpus of code could be rich in patterns, and the corpus of software has statistical properties, which are similar to the corpus of natural language. In spite of code and text are similar, written code is a new problem domain for the techniques of natural language processing. Taking in account that synonymy is the base for building a WordNet, this work addresses the synonymy in source code repositories, presenting an approach and techniques based on naming patterns and term frequency for that. Keywords: naming patterns, term frequency, synonymy, word sense. 1.Introducción El uso del big data en diversos dominios de aplicación tiene un éxito abrumador (Allamanis et al., 2018; Raychev et al., 2019).
ISSN:1646-9895