ERIC - Search Results

Publication Date

In 2025	0
Since 2024	1
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	3
Since 2006 (last 20 years)	9

Descriptor

Computer Assisted Testing	31
Item Analysis	31
Test Reliability	31
Test Construction	14
Test Items	14
Test Validity	11
Adaptive Testing	10
Item Banks	9
Difficulty Level	7
Latent Trait Theory	7
Item Response Theory	6
Computer Programs	5
Higher Education	5
Scoring	5
Comparative Analysis	4
Evaluation Methods	4
Mathematical Models	4
Measurement Techniques	4
Models	4
Simulation	4
Test Interpretation	4
Test Length	4
Testing	4
Error of Measurement	3
Grading	3
More ▼

Source

Educational and Psychological…	2
Psychometrika	2
Applied Measurement in…	1
Applied Psychological…	1
Behavioral Research and…	1
CALICO Journal	1
Collegiate Microcomputer	1
Education	1
Educational Measurement:…	1
Journal of Computer-Based…	1
Journal of Education for…	1
Journal of Educational and…	1
Journal of Intelligence	1
Journal on Efficiency and…	1
Research Quarterly for…	1
Routledge, Taylor & Francis…	1
Turkish Online Journal of…	1
More ▼

Publication Type

Journal Articles	13
Reports - Research	13
Speeches/Meeting Papers	7
Reports - Descriptive	4
Reports - Evaluative	4
Books	2
Opinion Papers	2
Book/Product Reviews	1
Collected Works - General	1
Guides - Non-Classroom	1
Information Analyses	1
Numerical/Quantitative Data	1
Reference Materials -…	1
Tests/Questionnaires	1
More ▼

Education Level

Higher Education	2
Grade 1	1
Postsecondary Education	1
Secondary Education	1
Two Year Colleges	1

Audience

Researchers	4
Practitioners	3
Teachers	2

Location

Belgium	1
California	1
France	1
United States	1

Laws, Policies, & Programs

Elementary and Secondary…

Assessments and Surveys

Stanford Binet Intelligence…

What Works Clearinghouse Rating

Showing 1 to 15 of 31 results Save | Export

Development of Computer-Based Chemical Five-Tier Diagnostic Test Instruments: A Generalized Partial Credit Model

Peer reviewed
PDF on ERIC

Download full text

Achmad Rante Suparman; Eli Rohaeti; Sri Wening – Journal on Efficiency and Responsibility in Education and Science, 2024

This study focuses on developing a five-tier chemical diagnostic test based on a computer-based test with 11 assessment categories with an assessment score from 0 to 10. A total of 20 items produced were validated by education experts, material experts, measurement experts, and media experts, and an average index of the Aiken test > 0.70 was…

Descriptors: Chemistry, Diagnostic Tests, Computer Assisted Testing, Credits

Same Test, Better Scores: Boosting the Reliability of Short Online Intelligence Recruitment Tests with Nested Logit Item Response Theory Models

Peer reviewed
PDF on ERIC

Download full text

Storme, Martin; Myszkowski, Nils; Baron, Simon; Bernard, David – Journal of Intelligence, 2019

Assessing job applicants' general mental ability online poses psychometric challenges due to the necessity of having brief but accurate tests. Recent research (Myszkowski & Storme, 2018) suggests that recovering distractor information through Nested Logit Models (NLM; Suh & Bolt, 2010) increases the reliability of ability estimates in…

Descriptors: Intelligence Tests, Item Response Theory, Comparative Analysis, Test Reliability

Does MTV Really Do a Good Job of Evaluating Professors? An Empirical Test of the Internet Site Ratemyprofessors.com

Peer reviewed

Direct link

Murray, Keith B.; Zdravkovic, Srdan – Journal of Education for Business, 2016

Considerable debate continues regarding the efficacy of the website RateMyProfessors.com (RMP). To date, however, virtually no direct, experimental research has been reported which directly bears on questions relating to sampling adequacy or item adequacy in producing what favorable correlations have been reported. The authors compare the data…

Descriptors: Computer Assisted Testing, Computer Software Evaluation, Student Evaluation of Teacher Performance, Item Analysis

Multidimensional CAT Item Selection Methods for Domain Scores and Composite Scores: Theory and Applications

Peer reviewed

Direct link

Yao, Lihua – Psychometrika, 2012

Multidimensional computer adaptive testing (MCAT) can provide higher precision and reliability or reduce test length when compared with unidimensional CAT or with the paper-and-pencil test. This study compared five item selection procedures in the MCAT framework for both domain scores and overall scores through simulation by varying the structure…

Descriptors: Item Banks, Test Length, Simulation, Adaptive Testing

Sustainable Assessment and Evaluation Strategies for Open and Distance Learning

Peer reviewed
PDF on ERIC

Download full text

Okonkwo, Charity Akuadi – Turkish Online Journal of Distance Education, 2010

This paper first presents an overview of the concepts of assessment and evaluation in Open and Distance Learning (ODL) environment. The large numbers of students and numerous courses make assessment and evaluation very difficult and administrative nightmare at Distance Learning (DL) institutions. These challenges informed exploring issues relating…

Descriptors: Distance Education, Sustainability, Evaluation Methods, Educational Strategies

The Development of K-8 Progress Monitoring Measures in Mathematics for Use with the 2% and General Education Populations: Grade 1. Technical Report # 0919

Download full text

Alonzo, Julie; Tindal, Gerald – Behavioral Research and Teaching, 2009

In this technical report, we describe the development and piloting of a series of mathematics progress monitoring measures intended for use with students in grade 1. These measures, available as part of easyCBM [TM], an online progress monitoring assessment system, were developed in 2008 and administered to approximately 2800 students from schools…

Descriptors: Academic Achievement, Research Reports, Grade 1, Outcome Measures

A Theory of Consistency of Ordering Generalizable to Tailored Testing

Peer reviewed

Cliff, Norman – Psychometrika, 1977

Measures of consistency and completeness of order relationships derived from test data such as Guttman scales are proposed. The measures are generalized to apply to incomplete data such as data from tailored testing. (Author/JKS)

Descriptors: Adaptive Testing, Computer Assisted Testing, Computer Programs, Item Analysis

Reliability and Validity of the Flemish Physical Activity Computerized Questionnaire in Adults

Peer reviewed

Direct link

Matton, Lynn; Wijndaele, Katrien; Duvigneaud, Nathalie; Duquet, William; Philippaerts, Renaat; Thomis, Martine; Lefevre, Johan – Research Quarterly for Exercise and Sport, 2007

The purpose of this study was to investigate the test-retest reliability and concurrent validity of the Flemish Physical Activity Computerized Questionnaire (FPACQ) in employed/unemployed and retired people. The FPACQ was developed to assess detailed information on several dimensions of physical activity and sedentary behavior over a usual week. A…

Descriptors: Physical Activities, Physical Activity Level, Questionnaires, Item Analysis

Test Pac: A Program for Comprehensive Item and Reliability Analysis.

Peer reviewed

Luecht, Richard M. – Educational and Psychological Measurement, 1987

Test Pac, a test scoring and analysis computer program for moderate-sized sample designs using dichotomous response items, performs comprehensive item analyses and multiple reliability estimates. It also performs single-facet generalizability analysis of variance, single-parameter item response theory analyses, test score reporting, and computer…

Descriptors: Computer Assisted Testing, Computer Software, Computer Software Reviews, Item Analysis

A Sharing Item Response Theory Model for Computerized Adaptive Testing

Peer reviewed

Direct link

Segall, Daniel O. – Journal of Educational and Behavioral Statistics, 2004

A new sharing item response theory (SIRT) model is presented that explicitly models the effects of sharing item content between informants and test takers. This model is used to construct adaptive item selection and scoring rules that provide increased precision and reduced score gains in instances where sharing occurs. The adaptive item selection…

Descriptors: Scoring, Item Analysis, Item Response Theory, Adaptive Testing

An Investigation of the Differential Effort Received by Items on a Low-Stakes Computer-Based Test

Peer reviewed

Direct link

Wise, Steven L. – Applied Measurement in Education, 2006

In low-stakes testing, the motivation levels of examinees are often a matter of concern to test givers because a lack of examinee effort represents a direct threat to the validity of the test data. This study investigated the use of response time to assess the amount of examinee effort received by individual test items. In 2 studies, it was found…

Descriptors: Computer Assisted Testing, Motivation, Test Validity, Item Response Theory

A Tailored Testing Model Employing the Beta Distribution and Conditional Difficulties

Kalisch, Stanley J. – Journal of Computer-Based Instruction, 1974

A tailored testing model employing the beta distribution, whose mean equals the difficulty of an item and whose variance is approximately equal to the sampling variance of the item difficulty, and employing conditional item difficulties, is proposed. (Author)

Descriptors: Adaptive Testing, Computer Assisted Testing, Evaluation Methods, Item Analysis

TESTER: A Computer Program to Produce Individualized Multiple Choice Tests.

Peer reviewed

Hamer, Robert; Young, Forrest W. – Educational and Psychological Measurement, 1978

TESTER, a computer program which produces individualized objective tests from a pool of items, is described. Available in both PL/1 and FORTRAN, TESTER may be executed either interactively or in batch. (Author/JKS)

Descriptors: Adaptive Testing, Computer Assisted Testing, Computer Programs, Individualized Instruction

Testing Computer Assisted Language Testing: Towards a Checklist for CALT.

Peer reviewed

Noijons, Jose – CALICO Journal, 1994

Defines computer assisted language testing (CALT), discusses the various processes involved, outlines the advantages and disadvantages, and examines psychometric aspects of computer testing. A table of factors distinguishes between test content and the mechanics of test taking. These factors constitute a table for developing a CALT checklist. (24…

Descriptors: Check Lists, Computer Assisted Testing, Factor Analysis, Feedback

Predicting Item Difficulty in a Reading Comprehension Test with an Artificial Neural Network.

Download full text

Perkins, Kyle; And Others – 1994

This paper reports the results of using a three-layer backpropagation artificial neural network to predict item difficulty in a reading comprehension test. Two network structures were developed, one with and one without a sigmoid function in the output processing unit. The data set, which consisted of a table of coded test items and corresponding…

Descriptors: Artificial Intelligence, Computer Assisted Testing, Expert Systems, Item Analysis

Previous Page | Next Page »

Pages: 1 | 2 | 3

Cliff, Norman	2
Patience, Wayne M.	2
Reckase, Mark D.	2
Achmad Rante Suparman	1
Alonzo, Julie	1
Baron, Simon	1
Bernard, David	1
Cason, Gerald J.	1
Chase, Clinton I.	1
Cohen, Allan S., Comp.	1
Denison, D. Brian, Ed.	1
Duquet, William	1
Duvigneaud, Nathalie	1
Eli Rohaeti	1
Hamer, Robert	1
Hampilos, John P.	1
Harnisch, Delwyn L.	1
Jacobs, Lucy Cheser	1
Kalisch, Stanley J.	1
Kolstad, Rosemarie K.	1
Lefevre, Johan	1
Levitov, Justin E.	1
Lord, Frederic M.	1
Luecht, Richard M.	1
More ▼