Publication Date
In 2025 | 0 |
Since 2024 | 1 |
Since 2021 (last 5 years) | 1 |
Since 2016 (last 10 years) | 3 |
Since 2006 (last 20 years) | 9 |
Descriptor
Computer Assisted Testing | 31 |
Item Analysis | 31 |
Test Reliability | 31 |
Test Construction | 14 |
Test Items | 14 |
Test Validity | 11 |
Adaptive Testing | 10 |
Item Banks | 9 |
Difficulty Level | 7 |
Latent Trait Theory | 7 |
Item Response Theory | 6 |
More ▼ |
Source
Author
Publication Type
Education Level
Higher Education | 2 |
Grade 1 | 1 |
Postsecondary Education | 1 |
Secondary Education | 1 |
Two Year Colleges | 1 |
Audience
Researchers | 4 |
Practitioners | 3 |
Teachers | 2 |
Location
Belgium | 1 |
California | 1 |
France | 1 |
United States | 1 |
Laws, Policies, & Programs
Elementary and Secondary… | 1 |
Assessments and Surveys
Stanford Binet Intelligence… | 1 |
What Works Clearinghouse Rating
Achmad Rante Suparman; Eli Rohaeti; Sri Wening – Journal on Efficiency and Responsibility in Education and Science, 2024
This study focuses on developing a five-tier chemical diagnostic test based on a computer-based test with 11 assessment categories with an assessment score from 0 to 10. A total of 20 items produced were validated by education experts, material experts, measurement experts, and media experts, and an average index of the Aiken test > 0.70 was…
Descriptors: Chemistry, Diagnostic Tests, Computer Assisted Testing, Credits
Storme, Martin; Myszkowski, Nils; Baron, Simon; Bernard, David – Journal of Intelligence, 2019
Assessing job applicants' general mental ability online poses psychometric challenges due to the necessity of having brief but accurate tests. Recent research (Myszkowski & Storme, 2018) suggests that recovering distractor information through Nested Logit Models (NLM; Suh & Bolt, 2010) increases the reliability of ability estimates in…
Descriptors: Intelligence Tests, Item Response Theory, Comparative Analysis, Test Reliability
Murray, Keith B.; Zdravkovic, Srdan – Journal of Education for Business, 2016
Considerable debate continues regarding the efficacy of the website RateMyProfessors.com (RMP). To date, however, virtually no direct, experimental research has been reported which directly bears on questions relating to sampling adequacy or item adequacy in producing what favorable correlations have been reported. The authors compare the data…
Descriptors: Computer Assisted Testing, Computer Software Evaluation, Student Evaluation of Teacher Performance, Item Analysis
Yao, Lihua – Psychometrika, 2012
Multidimensional computer adaptive testing (MCAT) can provide higher precision and reliability or reduce test length when compared with unidimensional CAT or with the paper-and-pencil test. This study compared five item selection procedures in the MCAT framework for both domain scores and overall scores through simulation by varying the structure…
Descriptors: Item Banks, Test Length, Simulation, Adaptive Testing
Okonkwo, Charity Akuadi – Turkish Online Journal of Distance Education, 2010
This paper first presents an overview of the concepts of assessment and evaluation in Open and Distance Learning (ODL) environment. The large numbers of students and numerous courses make assessment and evaluation very difficult and administrative nightmare at Distance Learning (DL) institutions. These challenges informed exploring issues relating…
Descriptors: Distance Education, Sustainability, Evaluation Methods, Educational Strategies
Alonzo, Julie; Tindal, Gerald – Behavioral Research and Teaching, 2009
In this technical report, we describe the development and piloting of a series of mathematics progress monitoring measures intended for use with students in grade 1. These measures, available as part of easyCBM [TM], an online progress monitoring assessment system, were developed in 2008 and administered to approximately 2800 students from schools…
Descriptors: Academic Achievement, Research Reports, Grade 1, Outcome Measures

Cliff, Norman – Psychometrika, 1977
Measures of consistency and completeness of order relationships derived from test data such as Guttman scales are proposed. The measures are generalized to apply to incomplete data such as data from tailored testing. (Author/JKS)
Descriptors: Adaptive Testing, Computer Assisted Testing, Computer Programs, Item Analysis
Matton, Lynn; Wijndaele, Katrien; Duvigneaud, Nathalie; Duquet, William; Philippaerts, Renaat; Thomis, Martine; Lefevre, Johan – Research Quarterly for Exercise and Sport, 2007
The purpose of this study was to investigate the test-retest reliability and concurrent validity of the Flemish Physical Activity Computerized Questionnaire (FPACQ) in employed/unemployed and retired people. The FPACQ was developed to assess detailed information on several dimensions of physical activity and sedentary behavior over a usual week. A…
Descriptors: Physical Activities, Physical Activity Level, Questionnaires, Item Analysis

Luecht, Richard M. – Educational and Psychological Measurement, 1987
Test Pac, a test scoring and analysis computer program for moderate-sized sample designs using dichotomous response items, performs comprehensive item analyses and multiple reliability estimates. It also performs single-facet generalizability analysis of variance, single-parameter item response theory analyses, test score reporting, and computer…
Descriptors: Computer Assisted Testing, Computer Software, Computer Software Reviews, Item Analysis
Segall, Daniel O. – Journal of Educational and Behavioral Statistics, 2004
A new sharing item response theory (SIRT) model is presented that explicitly models the effects of sharing item content between informants and test takers. This model is used to construct adaptive item selection and scoring rules that provide increased precision and reduced score gains in instances where sharing occurs. The adaptive item selection…
Descriptors: Scoring, Item Analysis, Item Response Theory, Adaptive Testing
Wise, Steven L. – Applied Measurement in Education, 2006
In low-stakes testing, the motivation levels of examinees are often a matter of concern to test givers because a lack of examinee effort represents a direct threat to the validity of the test data. This study investigated the use of response time to assess the amount of examinee effort received by individual test items. In 2 studies, it was found…
Descriptors: Computer Assisted Testing, Motivation, Test Validity, Item Response Theory
Kalisch, Stanley J. – Journal of Computer-Based Instruction, 1974
A tailored testing model employing the beta distribution, whose mean equals the difficulty of an item and whose variance is approximately equal to the sampling variance of the item difficulty, and employing conditional item difficulties, is proposed. (Author)
Descriptors: Adaptive Testing, Computer Assisted Testing, Evaluation Methods, Item Analysis

Hamer, Robert; Young, Forrest W. – Educational and Psychological Measurement, 1978
TESTER, a computer program which produces individualized objective tests from a pool of items, is described. Available in both PL/1 and FORTRAN, TESTER may be executed either interactively or in batch. (Author/JKS)
Descriptors: Adaptive Testing, Computer Assisted Testing, Computer Programs, Individualized Instruction

Noijons, Jose – CALICO Journal, 1994
Defines computer assisted language testing (CALT), discusses the various processes involved, outlines the advantages and disadvantages, and examines psychometric aspects of computer testing. A table of factors distinguishes between test content and the mechanics of test taking. These factors constitute a table for developing a CALT checklist. (24…
Descriptors: Check Lists, Computer Assisted Testing, Factor Analysis, Feedback
Perkins, Kyle; And Others – 1994
This paper reports the results of using a three-layer backpropagation artificial neural network to predict item difficulty in a reading comprehension test. Two network structures were developed, one with and one without a sigmoid function in the output processing unit. The data set, which consisted of a table of coded test items and corresponding…
Descriptors: Artificial Intelligence, Computer Assisted Testing, Expert Systems, Item Analysis