Publication Date
In 2025 | 0 |
Since 2024 | 1 |
Since 2021 (last 5 years) | 1 |
Since 2016 (last 10 years) | 4 |
Since 2006 (last 20 years) | 10 |
Descriptor
Foreign Countries | 18 |
Test Items | 18 |
Computer Assisted Testing | 9 |
Testing Problems | 6 |
Adaptive Testing | 5 |
Item Response Theory | 5 |
Models | 5 |
Latent Trait Theory | 4 |
Statistical Analysis | 4 |
Test Format | 4 |
Adults | 3 |
More ▼ |
Source
Applied Psychological… | 2 |
Educational and Psychological… | 2 |
OECD Publishing | 2 |
ETS Research Report Series | 1 |
Geographical Education | 1 |
Grantee Submission | 1 |
Psicologica: International… | 1 |
Psychometrika | 1 |
Author
Kelderman, Henk | 3 |
Eggen, Theo J. H. M. | 2 |
Meijer, Rob R. | 2 |
Tendeiro, Jorge N. | 2 |
Bijsterbosch, Erik | 1 |
Egberink, Iris J. L. | 1 |
Eggen, T. J. H. M. | 1 |
He, Qiwei | 1 |
Hessen, David J. | 1 |
Hol, A. Michiel | 1 |
Jingchen Liu | 1 |
More ▼ |
Publication Type
Journal Articles | 9 |
Reports - Evaluative | 8 |
Reports - Research | 8 |
Speeches/Meeting Papers | 5 |
Collected Works - General | 1 |
Reports - Descriptive | 1 |
Reports - General | 1 |
Education Level
Higher Education | 2 |
Secondary Education | 2 |
Audience
Researchers | 1 |
Location
Netherlands | 18 |
France | 3 |
Germany | 3 |
Ireland | 3 |
Japan | 3 |
United States | 3 |
Australia | 2 |
Austria | 2 |
Belgium | 2 |
Denmark | 2 |
Estonia | 2 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
Program for the International… | 2 |
Program for International… | 1 |
What Works Clearinghouse Rating
OECD Publishing, 2019
Log files from computer-based assessment can help better understand respondents' behaviours and cognitive strategies. Analysis of timing information from Programme for the International Assessment of Adult Competencies (PIAAC) reveals large differences in the time participants take to answer assessment items, as well as large country differences…
Descriptors: Adults, Computer Assisted Testing, Test Items, Reaction Time
Susu Zhang; Xueying Tang; Qiwei He; Jingchen Liu; Zhiliang Ying – Grantee Submission, 2024
Computerized assessments and interactive simulation tasks are increasingly popular and afford the collection of process data, i.e., an examinee's sequence of actions (e.g., clickstreams, keystrokes) that arises from interactions with each task. Action sequence data contain rich information on the problem-solving process but are in a nonstandard,…
Descriptors: Correlation, Problem Solving, Computer Assisted Testing, Prediction
Bijsterbosch, Erik – Geographical Education, 2018
Geography teachers' school-based (internal) examinations in pre-vocational geography education in the Netherlands appear to be in line with the findings in the literature, namely that teachers' assessment practices tend to focus on the recall of knowledge. These practices are strongly influenced by national (external) examinations. This paper…
Descriptors: Foreign Countries, Instructional Effectiveness, National Competency Tests, Geography Instruction
Yamamoto, Kentaro; He, Qiwei; Shin, Hyo Jeong; von Davier, Mattias – ETS Research Report Series, 2017
Approximately a third of the Programme for International Student Assessment (PISA) items in the core domains (math, reading, and science) are constructed-response items and require human coding (scoring). This process is time-consuming, expensive, and prone to error as often (a) humans code inconsistently, and (b) coding reliability in…
Descriptors: Foreign Countries, Achievement Tests, International Assessment, Secondary School Students
Tendeiro, Jorge N.; Meijer, Rob R. – Applied Psychological Measurement, 2012
This article extends the work by Armstrong and Shi on CUmulative SUM (CUSUM) person-fit methodology. The authors present new theoretical considerations concerning the use of CUSUM person-fit statistics based on likelihood ratios for the purpose of detecting cheating and random guessing by individual test takers. According to the Neyman-Pearson…
Descriptors: Cheating, Individual Testing, Adaptive Testing, Statistics
Egberink, Iris J. L.; Meijer, Rob R.; Tendeiro, Jorge N. – Educational and Psychological Measurement, 2015
A popular method to assess measurement invariance of a particular item is based on likelihood ratio tests with all other items as anchor items. The results of this method are often only reported in terms of statistical significance, and researchers proposed different methods to empirically select anchor items. It is unclear, however, how many…
Descriptors: Personality Measures, Computer Assisted Testing, Measurement, Test Items
OECD Publishing, 2013
The Programme for the International Assessment of Adult Competencies (PIAAC) has been planned as an ongoing program of assessment. The first cycle of the assessment has involved two "rounds." The first round, which is covered by this report, took place over the period of January 2008-October 2013. The main features of the first cycle of…
Descriptors: International Assessment, Adults, Skills, Test Construction
Veldkamp, Bernard P.; Verschoor, Angela J.; Eggen, Theo J. H. M. – Psicologica: International Journal of Methodology and Experimental Psychology, 2010
Overexposure and underexposure of items in the bank are serious problems in operational computerized adaptive testing (CAT) systems. These exposure problems might result in item compromise, or point at a waste of investments. The exposure control problem can be viewed as a test assembly problem with multiple objectives. Information in the test has…
Descriptors: Adaptive Testing, Item Analysis, Computer Assisted Testing, Test Items
Hessen, David J. – Psychometrika, 2012
A multinormal partial credit model for factor analysis of polytomously scored items with ordered response categories is derived using an extension of the Dutch Identity (Holland in "Psychometrika" 55:5-18, 1990). In the model, latent variables are assumed to have a multivariate normal distribution conditional on unweighted sums of item…
Descriptors: Foreign Countries, Factor Analysis, Testing, Scoring
Hol, A. Michiel; Vorst, Harrie C. M.; Mellenbergh, Gideon J. – Applied Psychological Measurement, 2007
In a randomized experiment (n = 515), a computerized and a computerized adaptive test (CAT) are compared. The item pool consists of 24 polytomous motivation items. Although items are carefully selected, calibration data show that Samejima's graded response model did not fit the data optimally. A simulation study is done to assess possible…
Descriptors: Student Motivation, Simulation, Adaptive Testing, Computer Assisted Testing

Eggen, T. J. H. M.; Straetmans, G. J. J. M. – Educational and Psychological Measurement, 2000
Studied the use of adaptive testing when examinees are classified into three categories. Established testing algorithms with two different statistical computation procedures and evaluated them through simulation using an operative item bank from Dutch basic adult education. Results suggest a reduction of at least 22% in the mean number of items…
Descriptors: Adaptive Testing, Adult Education, Algorithms, Classification
Kelderman, Henk – 1986
A method is proposed for the detection of item bias with respect to observed or unobserved subgroups. The method uses quasi-loglinear models for the incomplete subgroup x test score x item 1 x ... x item k contingency table. If the subgroup membership is unknown, the models are the incomplete-latent-class models of S. J. Haberman (1979). The…
Descriptors: Foreign Countries, Higher Education, Latent Trait Theory, Mathematical Models
Kelderman, Henk; Macready, George B. – 1988
The use of loglinear latent class models to detect item bias was studied. Purposes of the study were to: (1) develop procedures for use in assessing item bias when the grouping variable with respect to which bias occurs is not observed; (2) develop bias detection procedures that relate to a conceptually different assessed trait--a categorical…
Descriptors: Foreign Countries, Higher Education, Latent Trait Theory, Mathematical Models

Kelderman, Henk – 1986
A method is proposed to equate different sets of items administered to different groups of individuals using the Rasch model. A Rasch equating model was formulated to describe one common Rasch scale in different groups with different but overlapping sets of items. The item parameters can then be estimated simultaneously, avoiding different…
Descriptors: Equated Scores, Equations (Mathematics), Estimation (Mathematics), Foreign Countries
Samson, Digna M. M. – 1983
The traditional multiple-choice reading comprehension test of English as a second language, used in the Dutch school-leaving examinations, has been criticized for its apparent lack of construct validity. The Dutch National Institute for Educational Measurement has conducted a number of studies to determine whether there is a different skill…
Descriptors: English (Second Language), Foreign Countries, Language Tests, Multiple Choice Tests
Previous Page | Next Page ยป
Pages: 1 | 2