Loading…
Enfocando la Sinonimia sobre Repositorios de Código Fuente
We expect that a huge corpus of code could be rich in patterns, and the corpus of software has statistical properties, which are similar to the corpus of natural language. In spite of code and text are similar, written code is a new problem domain for the techniques of natural language processing. T...
Saved in:
Published in: | RISTI : Revista Ibérica de Sistemas e Tecnologias de Informação 2020-07 (E31), p.573-589 |
---|---|
Main Author: | |
Format: | Article |
Language: | Spanish |
Subjects: | |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Summary: | We expect that a huge corpus of code could be rich in patterns, and the corpus of software has statistical properties, which are similar to the corpus of natural language. In spite of code and text are similar, written code is a new problem domain for the techniques of natural language processing. Taking in account that synonymy is the base for building a WordNet, this work addresses the synonymy in source code repositories, presenting an approach and techniques based on naming patterns and term frequency for that. Keywords: naming patterns, term frequency, synonymy, word sense. 1.Introducción El uso del big data en diversos dominios de aplicación tiene un éxito abrumador (Allamanis et al., 2018; Raychev et al., 2019). |
---|---|
ISSN: | 1646-9895 |