Loading…

DETECCIÓN ERRÓNEA DEL FUNCIONAMIENTO DIFERENCIAL DEL ÍTEM.UNA COMPARACIÓN DE MÉTODOS

Se comparan distintos métodos para analizar el Funcionamiento Diferencial del Ítem (DIF), en términos de sus proporciones de falsos positivos. Éstos son: la prueba normal para la diferencia de los parámetros de dificultad (PN), la prueba χ2 de Mantel-Haenszel y la clasificación en ‘tipo C’ o no segú...

Full description

Saved in:

Bibliographic Details
Published in:	Psicothema 2005-01, Vol.17 (2), p.350
Main Authors:	Aguerri, María Ester, Galibert, María Silvia, Zanelli, Marta L, Attorresi, Horacio F
Format:	Article
Language:	eng ; spa
Subjects:	Chi-square test Educational evaluation Statistical analysis
Online Access:	Get full text
Tags:	Add Tag No Tags, Be the first to tag this record!

Description
Summary:	Se comparan distintos métodos para analizar el Funcionamiento Diferencial del Ítem (DIF), en términos de sus proporciones de falsos positivos. Éstos son: la prueba normal para la diferencia de los parámetros de dificultad (PN), la prueba χ2 de Mantel-Haenszel y la clasificación en ‘tipo C’ o no según la sola magnitud del estadístico MH D-DIF (clasificación empírica) —estos últimos de manera estándar y bietápica— y según el criterio del ETS. Se aplicaron a datos simulados sin DIF bajo distintas condiciones de diseño según tamaño de muestra, impacto y configuraciones de los parámetros de los ítemes. Los procedimientos bietápicos condujeron a resultados similares a los estándar en presencia de impacto. El método que generalmente arroja menores proporciones de falsos positivos es la PN; le siguen ambos tipos de clasificación las cuales, teniendo el mismo costo computacional que la prueba de Mantel-Haenszel, fallan mucho menos en la detección errónea del DIF. Erroneous detection of Differential Item Functioning. A comparison of methods. Some methods to analyze Differential Item Functioning (DIF) are compared in terms of their false positive rates, namely the normal test (NT) for the difference of difficulty parameters, the Mantel-Haenszel chi-square test and the item classification in ‘Type C’ or ‘Not Type C’ according to the only magnitude of the MH D-DIF statistic (empirical classification) -the last two on both a single-stage and two-stage procedures- and according to ETS criterion. They were applied to data without DIF which were simulated under different design conditions with respect to sample size, impact and item parameter configurations. The results obtained from two-stage and single-stage procedures were similar when impact is present. The method that generally produces lower false positive rates is the NT one, followed by both types of classification which, holding the same computational cost as Mantel-Haenszel test, have a much lower failure rate in the erroneous detection of DIF.
ISSN:	0214-9915 1886-144X