ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	2
Since 2006 (last 20 years)	5

Source

Journal of Educational…	2
Applied Measurement in…	1
Applied Psychological…	1
Educational Technology &…	1
International Journal of…	1
Journal of Applied Measurement	1
Journal of Computer Assisted…	1
Journal of Educational and…	1
Journal of Technology,…	1
National Center for Education…	1
Partnership for Assessment of…	1
Practical Assessment,…	1
Psychometrika	1
More ▼

Publication Type

Reports - Evaluative	30
Journal Articles	12
Speeches/Meeting Papers	7
Numerical/Quantitative Data	1
Opinion Papers	1

Education Level

Secondary Education

Audience

Practitioners

Location

Malaysia	1
Ohio	1

Laws, Policies, & Programs

Assessments and Surveys

Graduate Record Examinations	2
National Assessment of…	2
Test of English as a Foreign…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 30 results Save | Export

Artificial Intelligence in Educational Assessment: 'Breakthrough? Or Buncombe and Ballyhoo?'

Peer reviewed

Direct link

Gardner, John; O'Leary, Michael; Yuan, Li – Journal of Computer Assisted Learning, 2021

Artificial Intelligence is at the heart of modern society with computers now capable of making process decisions in many spheres of human activity. In education, there has been intensive growth in systems that make formal and informal learning an anytime, anywhere activity for billions of people through online open educational resources and…

Descriptors: Artificial Intelligence, Educational Assessment, Formative Evaluation, Summative Evaluation

PARCC Final Technical Report for 2015 Administration

Download full text

Partnership for Assessment of Readiness for College and Careers, 2016

The Partnership for Assessment of Readiness for College and Careers (PARCC) is a state-led consortium designed to create next-generation assessments that, compared to traditional K-12 assessments, more accurately measure student progress toward college and career readiness. The PARCC assessments are aligned to the Common Core State Standards…

Descriptors: Standardized Tests, Career Readiness, College Readiness, Test Validity

Guessing, Partial Knowledge, and Misconceptions in Multiple-Choice Tests

Peer reviewed

Direct link

Lau, Paul Ngee Kiong; Lau, Sie Hoe; Hong, Kian Sam; Usop, Hasbee – Educational Technology & Society, 2011

The number right (NR) method, in which students pick one option as the answer, is the conventional method for scoring multiple-choice tests that is heavily criticized for encouraging students to guess and failing to credit partial knowledge. In addition, computer technology is increasingly used in classroom assessment. This paper investigates the…

Descriptors: Guessing (Tests), Multiple Choice Tests, Computers, Scoring

Gating Items: Definition, Significance, and Need for Further Study

Peer reviewed

Direct link

Judd, Wallace – Practical Assessment, Research & Evaluation, 2009

Over the past twenty years in performance testing a specific item type with distinguishing characteristics has arisen time and time again. It's been invented independently by dozens of test development teams. And yet this item type is not recognized in the research literature. This article is an invitation to investigate the item type, evaluate…

Descriptors: Test Items, Test Format, Evaluation, Item Analysis

A Review of Item Exposure Control Strategies for Computerized Adaptive Testing Developed from 1983 to 2005

Peer reviewed
PDF on ERIC

Download full text

Direct link

Georgiadou, Elissavet; Triantafillou, Evangelos; Economides, Anastasios A. – Journal of Technology, Learning, and Assessment, 2007

Since researchers acknowledged the several advantages of computerized adaptive testing (CAT) over traditional linear test administration, the issue of item exposure control has received increased attention. Due to CAT's underlying philosophy, particular items in the item pool may be presented too often and become overexposed, while other items are…

Descriptors: Adaptive Testing, Computer Assisted Testing, Scoring, Test Items

Choosing: A Test. ETS Program Statistics Research.

Download full text

Wainer, Howard; Thissen, David – 1992

If examinees are permitted to choose to answer a subset of the questions on a test, just knowing which questions were chosen can provide a measure of proficiency that may be as reliable as would have been obtained from the test graded traditionally. This new method of scoring is much less time consuming and expensive for both the examinee and the…

Descriptors: Adaptive Testing, Cost Effectiveness, Responses, Scoring

Polytomous Modeling of Cognitive Errors in Computer Adaptive Testing.

Peer reviewed

Wang, LihShing; Li, Chun-Shan – Journal of Applied Measurement, 2001

Used Monte Carlo simulation to compare the relative measurement efficiency of polytomous modeling and dichotomous modeling under different scoring schemes and termination criteria. Results suggest that polytomous computerized adaptive testing (CAT) yields marginal gains over dichotomous CAT when termination criteria are more stringent. Discusses…

Descriptors: Adaptive Testing, Comparative Analysis, Computer Assisted Testing, Monte Carlo Methods

An Alternative Method for Scoring Adaptive Tests.

Peer reviewed

Stocking, Martha L. – Journal of Educational and Behavioral Statistics, 1996

An alternative method for scoring adaptive tests, based on number-correct scores, is explored and compared with a method that relies more directly on item response theory. Using the number-correct score with necessary adjustment for intentional differences in adaptive test difficulty is a statistically viable scoring method. (SLD)

Descriptors: Adaptive Testing, Computer Assisted Testing, Difficulty Level, Item Response Theory

Trace Lines for Testlets: A Use of Multiple-Categorical-Response Models.

Peer reviewed

Thissen, David; And Others – Journal of Educational Measurement, 1989

An approach to scoring reading comprehension based on the concept of the testlet is described, using models developed for items in multiple categories. The model is illustrated using data from 3,866 examinees. Application of testlet scoring to multiple category models developed for individual items is discussed. (SLD)

Descriptors: Adaptive Testing, Computer Assisted Testing, Item Response Theory, Mathematical Models

General Ability Measurement: An Application of Multidimensional Adaptive Testing.

Download full text

Segall, Daniel O. – 1999

Two new methods for improving the measurement precision of a general test factor are proposed and evaluated. One new method provides a multidimensional item response theory estimate obtained from conventional administrations of multiple-choice test items that span general and nuisance dimensions. The other method chooses items adaptively to…

Descriptors: Ability, Adaptive Testing, Item Response Theory, Measurement Techniques

Item Response Modeling with BILOG-MG and MULTILOG for Windows

Peer reviewed

Direct link

Rupp, Andre A. – International Journal of Testing, 2003

Item response theory (IRT) has become one of the most popular scoring frameworks for measurement data. IRT models are used frequently in computerized adaptive testing, cognitively diagnostic assessment, and test equating. This article reviews two of the most popular software packages for IRT model estimation, BILOG-MG (Zimowski, Muraki, Mislevy, &…

Descriptors: Test Items, Adaptive Testing, Item Response Theory, Computer Software

An Alternative Method for Scoring Adaptive Tests. Research Report RR-94-48.

Download full text

Stocking, Martha L. – 1994

Modern applications of computerized adaptive testing (CAT) are typically grounded in item response theory (IRT; Lord, 1980). While the IRT foundations of adaptive testing provide a number of approaches to adaptive test scoring that may seem natural and efficient to psychometricians, these approaches may be more demanding for test takers, test…

Descriptors: Adaptive Testing, Computer Assisted Testing, Difficulty Level, Equated Scores

A New Look at Formulating Hypotheses Items. GRE Board Professional Report No. 85-14P.

Download full text

Carlson, Sybil B.; Ward, William C. – 1988

Issues concerning the cost and feasibility of using Formulating Hypotheses (FH) test item types for the Graduate Record Examinations have slowed research into their use. This project focused on two major issues that need to be addressed in considering FH items for operational use: the costs of scoring and the assignment of scores along a range of…

Descriptors: Adaptive Testing, Computer Assisted Testing, Costs, Pilot Projects

Evaluating an Automatically Scorable, Open-Ended Response Type for Measuring Mathematical Reasoning in Computer-Adaptive Tests.

Peer reviewed

Bennett, Randy Elliot; Steffen, Manfred; Singley, Mark Kevin; Morley, Mary; Jacquemin, Daniel – Journal of Educational Measurement, 1997

Scoring accuracy and item functioning were studied for an open-ended response type test in which correct answers can take many different surface forms. Results with 1,864 graduate school applicants showed automated scoring to approximate the accuracy of multiple-choice scoring. Items functioned similarly to other item types being considered. (SLD)

Descriptors: Adaptive Testing, Automation, College Applicants, Computer Assisted Testing

Flawed Items in Computerized Adaptive Testing.

Download full text

Potenza, Maria T.; Stocking, Martha L. – 1994

A multiple choice test item is identified as flawed if it has no single best answer. In spite of extensive quality control procedures, the administration of flawed items to test-takers is inevitable. Common strategies for dealing with flawed items in conventional testing, grounded in the principle of fairness to test-takers, are reexamined in the…

Descriptors: Adaptive Testing, Computer Assisted Testing, Multiple Choice Tests, Scoring

Previous Page | Next Page »

Pages: 1 | 2

Adaptive Testing	30
Scoring	30
Computer Assisted Testing	26
Test Items	19
Item Response Theory	14
Test Construction	13
Simulation	9
Test Use	7
Item Banks	6
Scores	6
Educational Assessment	5
Mathematical Models	5
Test Format	5
Testing Problems	5
Comparative Analysis	4
Difficulty Level	4
Test Validity	4
Testing	4
Equations (Mathematics)	3
Estimation (Mathematics)	3
Maximum Likelihood Statistics	3
Measurement Techniques	3
Models	3
Monte Carlo Methods	3
Multiple Choice Tests	3
More ▼

Stocking, Martha L.	5
Bock, R. Darrell	2
Mills, Craig N.	2
Segall, Daniel O.	2
Thissen, David	2
Wainer, Howard	2
Bennett, Randy Elliot	1
Bunderson, C. Victor	1
Carlson, Sybil B.	1
De Ayala, R. J.	1
Deck, Dennis	1
Economides, Anastasios A.	1
Gardner, John	1
Georgiadou, Elissavet	1
Hicks, Marilyn M.	1
Hong, Kian Sam	1
Hsu, Tse-Chi	1
Jacquemin, Daniel	1
Judd, Wallace	1
Kirisci, Levent	1
Lau, Paul Ngee Kiong	1
Lau, Sie Hoe	1
Laurier, Michel	1
Li, Chun-Shan	1
More ▼