Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 1 |
Since 2016 (last 10 years) | 2 |
Since 2006 (last 20 years) | 15 |
Descriptor
Computation | 17 |
Computer Assisted Testing | 17 |
Adaptive Testing | 11 |
Test Items | 9 |
Item Response Theory | 8 |
Equations (Mathematics) | 4 |
Simulation | 4 |
Classification | 3 |
Error of Measurement | 3 |
Models | 3 |
Ability | 2 |
More ▼ |
Source
Author
Chang, Hua-Hua | 1 |
Chang, Yuan-chin Ivan | 1 |
Chen, Li-Ju | 1 |
Cronin, John | 1 |
Duan, Qinglong | 1 |
Finkelman, Matthew David | 1 |
Gorham, Jerry | 1 |
Haynie, Kathleen | 1 |
Ho, Rong-Guey | 1 |
Jensen, Nate | 1 |
Jewsbury, Paul A. | 1 |
More ▼ |
Publication Type
Reports - Evaluative | 17 |
Journal Articles | 14 |
Education Level
Elementary Education | 3 |
Grade 4 | 2 |
Intermediate Grades | 2 |
Adult Education | 1 |
High Schools | 1 |
Higher Education | 1 |
Postsecondary Education | 1 |
Secondary Education | 1 |
Audience
Location
New York | 1 |
Taiwan | 1 |
United Kingdom | 1 |
Laws, Policies, & Programs
Assessments and Surveys
Iowa Tests of Basic Skills | 1 |
National Assessment of… | 1 |
What Works Clearinghouse Rating
Tan, Hongye; Wang, Chong; Duan, Qinglong; Lu, Yu; Zhang, Hu; Li, Ru – Interactive Learning Environments, 2023
Automatic short answer grading (ASAG) is a challenging task that aims to predict a score for a given student response. Previous works on ASAG mainly use nonneural or neural methods. However, the former depends on handcrafted features and is limited by its inflexibility and high cost, and the latter ignores global word cooccurrence in a corpus and…
Descriptors: Automation, Grading, Computer Assisted Testing, Graphs
Jewsbury, Paul A.; van Rijn, Peter W. – Journal of Educational and Behavioral Statistics, 2020
In large-scale educational assessment data consistent with a simple-structure multidimensional item response theory (MIRT) model, where every item measures only one latent variable, separate unidimensional item response theory (UIRT) models for each latent variable are often calibrated for practical reasons. While this approach can be valid for…
Descriptors: Item Response Theory, Computation, Test Items, Adaptive Testing
Jiao, Hong; Liu, Junhui; Haynie, Kathleen; Woo, Ada; Gorham, Jerry – Educational and Psychological Measurement, 2012
This study explored the impact of partial credit scoring of one type of innovative items (multiple-response items) in a computerized adaptive version of a large-scale licensure pretest and operational test settings. The impacts of partial credit scoring on the estimation of the ability parameters and classification decisions in operational test…
Descriptors: Test Items, Computer Assisted Testing, Measures (Individuals), Scoring
Cronin, John; Jensen, Nate – Northwest Evaluation Association, 2014
On August 7th, 2013, the New York State Education Commissioner, John King, announced the initial results of the state's new assessment, which was designed to measure college and career readiness relative to the Common Core Learning Standards. Commissioner King noted that the proficiency rates on these assessments were significantly lower than…
Descriptors: Academic Achievement, Academic Standards, State Standards, College Readiness
Adult Science Learners' Mathematical Mistakes: An Analysis of Responses to Computer-Marked Questions
Jordan, Sally – European Journal of Science and Mathematics Education, 2014
Inspection of thousands of student responses to computer-marked assessment questions has brought insight into the errors made by adult distance learners of science. Most of the questions analysed were in summative use and required students to construct their own response. Both of these things increased confidence in the reliability of the…
Descriptors: Foreign Countries, Undergraduate Students, College Science, Science Education
What Works Clearinghouse, 2014
The 2011 study, "Benefits of Practicing 4 = 2 + 2: Nontraditional Problem Formats Facilitate Children's Understanding of Mathematical Equivalence," examined the effects of addition practice using nontraditional problem formats on students' understanding of mathematical equivalence. In nontraditional problem formats, operations appear on…
Descriptors: Mathematics Instruction, Elementary School Students, Addition, Teaching Methods
Yen, Yung-Chin; Ho, Rong-Guey; Laio, Wen-Wei; Chen, Li-Ju; Kuo, Ching-Chin – Applied Psychological Measurement, 2012
In a selected response test, aberrant responses such as careless errors and lucky guesses might cause error in ability estimation because these responses do not actually reflect the knowledge that examinees possess. In a computerized adaptive test (CAT), these aberrant responses could further cause serious estimation error due to dynamic item…
Descriptors: Computer Assisted Testing, Adaptive Testing, Test Items, Response Style (Tests)
Wang, Wen-Chung; Liu, Chen-Wei – Educational and Psychological Measurement, 2011
The generalized graded unfolding model (GGUM) has been recently developed to describe item responses to Likert items (agree-disagree) in attitude measurement. In this study, the authors (a) developed two item selection methods in computerized classification testing under the GGUM, the current estimate/ability confidence interval method and the cut…
Descriptors: Computer Assisted Testing, Adaptive Testing, Classification, Item Response Theory
Chang, Yuan-chin Ivan; Lu, Hung-Yi – Psychometrika, 2010
Item calibration is an essential issue in modern item response theory based psychological or educational testing. Due to the popularity of computerized adaptive testing, methods to efficiently calibrate new items have become more important than that in the time when paper and pencil test administration is the norm. There are many calibration…
Descriptors: Test Items, Educational Testing, Adaptive Testing, Measurement
Schmitt, T. A.; Sass, D. A.; Sullivan, J. R.; Walker, C. M. – International Journal of Testing, 2010
Imposed time limits on computer adaptive tests (CATs) can result in examinees having difficulty completing all items, thus compromising the validity and reliability of ability estimates. In this study, the effects of speededness were explored in a simulated CAT environment by varying examinee response patterns to end-of-test items. Expectedly,…
Descriptors: Monte Carlo Methods, Simulation, Computer Assisted Testing, Adaptive Testing
Finkelman, Matthew David – Applied Psychological Measurement, 2010
In sequential mastery testing (SMT), assessment via computer is used to classify examinees into one of two mutually exclusive categories. Unlike paper-and-pencil tests, SMT has the capability to use variable-length stopping rules. One approach to shortening variable-length tests is stochastic curtailment, which halts examination if the probability…
Descriptors: Mastery Tests, Computer Assisted Testing, Adaptive Testing, Test Length
Chang, Hua-Hua; Ying, Zhiliang – Psychometrika, 2008
It has been widely reported that in computerized adaptive testing some examinees may get much lower scores than they would normally if an alternative paper-and-pencil version were given. The main purpose of this investigation is to quantitatively reveal the cause for the underestimation phenomenon. The logistic models, including the 1PL, 2PL, and…
Descriptors: Adaptive Testing, Computer Assisted Testing, Computation, Test Items
Veldkamp, Bernard P. – International Journal of Testing, 2008
Integrity[TM], an online application for testing both the statistical integrity of the test and the academic integrity of the examinees, was evaluated for this review. Program features and the program output are described. An overview of the statistics in Integrity[TM] is provided, and the application is illustrated with a small simulation study.…
Descriptors: Simulation, Integrity, Statistics, Computer Assisted Testing
Penfield, Randall D. – Educational and Psychological Measurement, 2007
The standard error of the maximum likelihood ability estimator is commonly estimated by evaluating the test information function at an examinee's current maximum likelihood estimate (a point estimate) of ability. Because the test information function evaluated at the point estimate may differ from the test information function evaluated at an…
Descriptors: Simulation, Adaptive Testing, Computation, Maximum Likelihood Statistics
Sangwin, Christopher J. – International Journal of Mathematical Education in Science and Technology, 2007
This paper concerns computer aided assessment (CAA) of mathematics in which a computer algebra system (CAS) is used to help assess students' responses to elementary algebra questions. Using a methodology of documentary analysis, we examine what is taught in elementary algebra. The STACK CAA system, http://www.stack.bham.ac.uk/, which uses the CAS…
Descriptors: Arithmetic, Algebra, Computer Assisted Testing, Mathematics Instruction
Previous Page | Next Page ยป
Pages: 1 | 2