ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	2
Since 2006 (last 20 years)	15

Descriptor

Computation	17
Computer Assisted Testing	17
Adaptive Testing	11
Test Items	9
Item Response Theory	8
Equations (Mathematics)	4
Simulation	4
Classification	3
Error of Measurement	3
Models	3
Ability	2
Bayesian Statistics	2
Comparative Analysis	2
Cutting Scores	2
Evaluation Methods	2
Foreign Countries	2
Grade 4	2
Item Banks	2
Mathematics Instruction	2
Mathematics Skills	2
Maximum Likelihood Statistics	2
Measurement	2
Probability	2
Problem Solving	2
Scoring	2
More ▼

Source

Applied Psychological…	3
Educational and Psychological…	3
International Journal of…	2
Psychometrika	2
European Journal of Science…	1
Interactive Learning…	1
International Journal of…	1
Journal of Educational and…	1
Northwest Evaluation…	1
What Works Clearinghouse	1

Publication Type

Reports - Evaluative	17
Journal Articles	14

Education Level

Elementary Education	3
Grade 4	2
Intermediate Grades	2
Adult Education	1
High Schools	1
Higher Education	1
Postsecondary Education	1
Secondary Education	1

Audience

Location

New York	1
Taiwan	1
United Kingdom	1

Laws, Policies, & Programs

Assessments and Surveys

Iowa Tests of Basic Skills	1
National Assessment of…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 17 results Save | Export

Automatic Short Answer Grading by Encoding Student Responses via a Graph Convolutional Network

Peer reviewed

Direct link

Tan, Hongye; Wang, Chong; Duan, Qinglong; Lu, Yu; Zhang, Hu; Li, Ru – Interactive Learning Environments, 2023

Automatic short answer grading (ASAG) is a challenging task that aims to predict a score for a given student response. Previous works on ASAG mainly use nonneural or neural methods. However, the former depends on handcrafted features and is limited by its inflexibility and high cost, and the latter ignores global word cooccurrence in a corpus and…

Descriptors: Automation, Grading, Computer Assisted Testing, Graphs

IRT and MIRT Models for Item Parameter Estimation with Multidimensional Multistage Tests

Peer reviewed

Direct link

Jewsbury, Paul A.; van Rijn, Peter W. – Journal of Educational and Behavioral Statistics, 2020

In large-scale educational assessment data consistent with a simple-structure multidimensional item response theory (MIRT) model, where every item measures only one latent variable, separate unidimensional item response theory (UIRT) models for each latent variable are often calibrated for practical reasons. While this approach can be valid for…

Descriptors: Item Response Theory, Computation, Test Items, Adaptive Testing

Comparison between Dichotomous and Polytomous Scoring of Innovative Items in a Large-Scale Computerized Adaptive Test

Peer reviewed

Direct link

Jiao, Hong; Liu, Junhui; Haynie, Kathleen; Woo, Ada; Gorham, Jerry – Educational and Psychological Measurement, 2012

This study explored the impact of partial credit scoring of one type of innovative items (multiple-response items) in a computerized adaptive version of a large-scale licensure pretest and operational test settings. The impacts of partial credit scoring on the estimation of the ability parameters and classification decisions in operational test…

Descriptors: Test Items, Computer Assisted Testing, Measures (Individuals), Scoring

The "Phantom" Collapse of Student Achievement in New York: Lessons for Educators as States Implement the Common Core

Download full text

Cronin, John; Jensen, Nate – Northwest Evaluation Association, 2014

On August 7th, 2013, the New York State Education Commissioner, John King, announced the initial results of the state's new assessment, which was designed to measure college and career readiness relative to the Common Core Learning Standards. Commissioner King noted that the proficiency rates on these assessments were significantly lower than…

Descriptors: Academic Achievement, Academic Standards, State Standards, College Readiness

Adult Science Learners' Mathematical Mistakes: An Analysis of Responses to Computer-Marked Questions

Peer reviewed
PDF on ERIC

Download full text

Jordan, Sally – European Journal of Science and Mathematics Education, 2014

Inspection of thousands of student responses to computer-marked assessment questions has brought insight into the errors made by adult distance learners of science. Most of the questions analysed were in summative use and required students to construct their own response. Both of these things increased confidence in the reliability of the…

Descriptors: Foreign Countries, Undergraduate Students, College Science, Science Education

WWC Review of the Report "Benefits of Practicing 4 = 2 + 2: Nontraditional Problem Formats Facilitate Children's Understanding of Mathematical Equivalence." What Works Clearinghouse Single Study Review

Peer reviewed
PDF on ERIC

Download full text

What Works Clearinghouse, 2014

The 2011 study, "Benefits of Practicing 4 = 2 + 2: Nontraditional Problem Formats Facilitate Children's Understanding of Mathematical Equivalence," examined the effects of addition practice using nontraditional problem formats on students' understanding of mathematical equivalence. In nontraditional problem formats, operations appear on…

Descriptors: Mathematics Instruction, Elementary School Students, Addition, Teaching Methods

An Empirical Evaluation of the Slip Correction in the Four Parameter Logistic Models with Computerized Adaptive Testing

Peer reviewed

Direct link

Yen, Yung-Chin; Ho, Rong-Guey; Laio, Wen-Wei; Chen, Li-Ju; Kuo, Ching-Chin – Applied Psychological Measurement, 2012

In a selected response test, aberrant responses such as careless errors and lucky guesses might cause error in ability estimation because these responses do not actually reflect the knowledge that examinees possess. In a computerized adaptive test (CAT), these aberrant responses could further cause serious estimation error due to dynamic item…

Descriptors: Computer Assisted Testing, Adaptive Testing, Test Items, Response Style (Tests)

Computerized Classification Testing under the Generalized Graded Unfolding Model

Peer reviewed

Direct link

Wang, Wen-Chung; Liu, Chen-Wei – Educational and Psychological Measurement, 2011

The generalized graded unfolding model (GGUM) has been recently developed to describe item responses to Likert items (agree-disagree) in attitude measurement. In this study, the authors (a) developed two item selection methods in computerized classification testing under the GGUM, the current estimate/ability confidence interval method and the cut…

Descriptors: Computer Assisted Testing, Adaptive Testing, Classification, Item Response Theory

Online Calibration via Variable Length Computerized Adaptive Testing

Peer reviewed

Direct link

Chang, Yuan-chin Ivan; Lu, Hung-Yi – Psychometrika, 2010

Item calibration is an essential issue in modern item response theory based psychological or educational testing. Due to the popularity of computerized adaptive testing, methods to efficiently calibrate new items have become more important than that in the time when paper and pencil test administration is the norm. There are many calibration…

Descriptors: Test Items, Educational Testing, Adaptive Testing, Measurement

A Monte Carlo Simulation Investigating the Validity and Reliability of Ability Estimation in Item Response Theory with Speeded Computer Adaptive Tests

Peer reviewed

Direct link

Schmitt, T. A.; Sass, D. A.; Sullivan, J. R.; Walker, C. M. – International Journal of Testing, 2010

Imposed time limits on computer adaptive tests (CATs) can result in examinees having difficulty completing all items, thus compromising the validity and reliability of ability estimates. In this study, the effects of speededness were explored in a simulated CAT environment by varying examinee response patterns to end-of-test items. Expectedly,…

Descriptors: Monte Carlo Methods, Simulation, Computer Assisted Testing, Adaptive Testing

Variations on Stochastic Curtailment in Sequential Mastery Testing

Peer reviewed

Direct link

Finkelman, Matthew David – Applied Psychological Measurement, 2010

In sequential mastery testing (SMT), assessment via computer is used to classify examinees into one of two mutually exclusive categories. Unlike paper-and-pencil tests, SMT has the capability to use variable-length stopping rules. One approach to shortening variable-length tests is stochastic curtailment, which halts examination if the probability…

Descriptors: Mastery Tests, Computer Assisted Testing, Adaptive Testing, Test Length

To Weight or Not to Weight? Balancing Influence of Initial Items in Adaptive Testing

Peer reviewed

Direct link

Chang, Hua-Hua; Ying, Zhiliang – Psychometrika, 2008

It has been widely reported that in computerized adaptive testing some examinees may get much lower scores than they would normally if an alternative paper-and-pencil version were given. The main purpose of this investigation is to quantitatively reveal the cause for the underestimation phenomenon. The logistic models, including the 1PL, 2PL, and…

Descriptors: Adaptive Testing, Computer Assisted Testing, Computation, Test Items

A Review of "Integrity[TM]"

Peer reviewed

Direct link

Veldkamp, Bernard P. – International Journal of Testing, 2008

Integrity[TM], an online application for testing both the statistical integrity of the test and the academic integrity of the examinees, was evaluated for this review. Program features and the program output are described. An overview of the statistics in Integrity[TM] is provided, and the application is illustrated with a small simulation study.…

Descriptors: Simulation, Integrity, Statistics, Computer Assisted Testing

Estimating the Standard Error of the Maximum Likelihood Ability Estimator in Adaptive Testing Using the Posterior-Weighted Test Information Function

Peer reviewed

Direct link

Penfield, Randall D. – Educational and Psychological Measurement, 2007

The standard error of the maximum likelihood ability estimator is commonly estimated by evaluating the test information function at an examinee's current maximum likelihood estimate (a point estimate) of ability. Because the test information function evaluated at the point estimate may differ from the test information function evaluated at an…

Descriptors: Simulation, Adaptive Testing, Computation, Maximum Likelihood Statistics

Assessing Elementary Algebra with STACK

Peer reviewed

Direct link

Sangwin, Christopher J. – International Journal of Mathematical Education in Science and Technology, 2007

This paper concerns computer aided assessment (CAA) of mathematics in which a computer algebra system (CAS) is used to help assess students' responses to elementary algebra questions. Using a methodology of documentary analysis, we examine what is taught in elementary algebra. The STACK CAA system, http://www.stack.bham.ac.uk/, which uses the CAS…

Descriptors: Arithmetic, Algebra, Computer Assisted Testing, Mathematics Instruction

Previous Page | Next Page »

Pages: 1 | 2

Chang, Hua-Hua	1
Chang, Yuan-chin Ivan	1
Chen, Li-Ju	1
Cronin, John	1
Duan, Qinglong	1
Finkelman, Matthew David	1
Gorham, Jerry	1
Haynie, Kathleen	1
Ho, Rong-Guey	1
Jensen, Nate	1
Jewsbury, Paul A.	1
Jiao, Hong	1
Jordan, Sally	1
Kuo, Ching-Chin	1
Laio, Wen-Wei	1
Li, Ru	1
Li, Yuan H.	1
Liu, Chen-Wei	1
Liu, Junhui	1
Lu, Hung-Yi	1
Lu, Yu	1
Penfield, Randall D.	1
Sangwin, Christopher J.	1
Sass, D. A.	1
Schafer, William D.	1
More ▼