22
ARTÍCULOS ORIGINALES
JOSUÉ PACHECO-ORTIZ, LISBETH RODRÍGUEZ-MAZAHUA, JEZREEL MEJÍA-MIRANDA, ISAAC MACHORRO-CANO,
GINER ALOR-HERNÁNDEZ, ULISES JUÁREZ-MARTÍNEZ
Revista Perspectiva Empresarial, Vol. 7, No. 2-1, julio-diciembre de 2020, 19-30
ISSN 2389-8186, E-ISSN 2389-8194
Introduction
Computer Adaptive Testing —CAT— (Chen,
Chao and Chen, 2019) has revolutionized the
traditional way of evaluating, since it dynamically
selects and manages the most appropriate questions
depending on the previous answers given by the
examinees. One of the central components of a CAT
is the item selection criterion (Miyazahua and Ueno,
2019), although the most widely used criterion
is Fisher’s Maximum Information (Albano et al.,
2019), it presents several weaknesses that generate
a certain degree of mistrust, for example, bias in
the item selection, estimation errors at the start
of the exam, or the same question being displayed
repeatedly to the tested one (Sheng, Bingwei and
Jiecheng, 2018; Du, Li and Chang, 2018; Lin and
Chang, 2019; Yigit, Sorrel and de la Torre, 2019;
Ye and Sun, 2018). Therefore, in this paper the
development of a CAT system that uses association
rules for the selection of items is proposed, focusing
on using the potential advantages of association
answered correctly or incorrectly and the questions
answered correctly, and thus present the most
appropriate questions (most likely to answer
correctly) in the tests, according to the responses
of the evaluated, considering the best rules (stored
in the database of students who submitted the
same test previously) with greater support and
Several research projects have used association
rule mining —ARM— with different algorithms in
their development, for example, in Rubio Delgado
et al. (2018), authors applied Apriori, FP-Growth,
PredictiveApriori and Tertius, grouping them
to compare them, so Apriori and FP-Growth were
values, whereas for PredictiveApriori and Tertius
time, the number of rules generated, the support
cases. In contrast, Wang et al. (2018) worked with
the Apriori algorithm, occupying for the comparison
whose set of generated rules were debugged based
on the minimum Lift, Chi-squared test and minimum
improvement. While in Prajapati, Garg and Chauhan
execution time and conviction were used in the
process of comparing the Distributed Frequent
Pattern Mining —DFMP—, Count Distribution
Algorithm —CDA— and Fast Distributed Mining
—FDM— algorithms.
The objective of this paper is to present a
comparative analysis of various ARM algorithms
that allows to select the most suitable for the
implementation in the CAT system that is
the introduction; (ii) background and some of the
works related to this research; (iii) the integration of
ARM in the CAT process and the comparison method
that was followed; (iv) the results and the analysis
Background and related works
Over the years, in different projects, various
tools have been applied in the development of the
parameter logistic model for item calibration (Lee
et al., 2018); maximum likelihood estimation for the
evaluator’s skill estimation (Albano et al., 2019);
and root mean square differences as an evaluation
criterion (Stafford et al., 2019), among others.
has been done to solve the problems presented
by Fisher’s Maximum Information, using other
selection strategies, for example, Bayesian networks
(Tokusada and Hirose, 2016), Greedy algorithm
(Bengs, Brefeld and Krohne, 2018), Kullback-Leibler
Information (Chen et al., 2017), Minimum Expected
Subsequent Variance (Rodríguez-Cuadrado et al.,
2020), to mention a few which, while they have
achieved favorable results, most have only been in
studies of simulation and not in real application.
We propose using ARM as an item selection
associations or correlations between the elements
or objects (in this case, test answers given by other
students in the past) of a database, it has many
can occur between correct/incorrect answers and
correct ones; (ii) they will determine the suitable