Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 1 |
Since 2016 (last 10 years) | 1 |
Since 2006 (last 20 years) | 5 |
Descriptor
Test Items | 15 |
Mathematical Models | 7 |
Models | 7 |
Item Analysis | 6 |
Test Construction | 6 |
Estimation (Mathematics) | 5 |
Latent Trait Theory | 5 |
Measurement Techniques | 5 |
Psychometrics | 5 |
Item Response Theory | 4 |
Verbal Tests | 4 |
More ▼ |
Source
College Board | 2 |
Behavioral Research and… | 1 |
ETS Research Report Series | 1 |
Grantee Submission | 1 |
Online Submission | 1 |
Author
Samejima, Fumiko | 2 |
Alonzo, Julie | 1 |
Anderson, Daniel | 1 |
Brennan, Robert L. | 1 |
David J. Weiss | 1 |
Eignor, Daniel R. | 1 |
Farish, Stephen J. | 1 |
Futagi, Yoko | 1 |
Gierl, Mark J. | 1 |
Gina Biancarosa | 1 |
Gokiert, Rebecca | 1 |
More ▼ |
Publication Type
Numerical/Quantitative Data | 15 |
Reports - Research | 9 |
Reports - Evaluative | 4 |
Books | 2 |
Guides - Non-Classroom | 1 |
Journal Articles | 1 |
Speeches/Meeting Papers | 1 |
Education Level
Higher Education | 3 |
Elementary Education | 2 |
Postsecondary Education | 2 |
High Schools | 1 |
Kindergarten | 1 |
Middle Schools | 1 |
Secondary Education | 1 |
Audience
Researchers | 1 |
Laws, Policies, & Programs
Assessments and Surveys
Medical College Admission Test | 2 |
SAT (College Admission Test) | 2 |
California Achievement Tests | 1 |
Graduate Record Examinations | 1 |
Iowa Tests of Basic Skills | 1 |
National Assessment of… | 1 |
What Works Clearinghouse Rating
Mark L. Davison; David J. Weiss; Ozge Ersan; Joseph N. DeWeese; Gina Biancarosa; Patrick C. Kennedy – Grantee Submission, 2021
MOCCA is an online assessment of inferential reading comprehension for students in 3rd through 6th grades. It can be used to identify good readers and, for struggling readers, identify those who overly rely on either a Paraphrasing process or an Elaborating process when their comprehension is incorrect. Here a propensity to over-rely on…
Descriptors: Reading Tests, Computer Assisted Testing, Reading Comprehension, Elementary School Students
Lee, Eunjung; Lee, Won-Chan; Brennan, Robert L. – College Board, 2012
In almost all high-stakes testing programs, test equating is necessary to ensure that test scores across multiple test administrations are equivalent and can be used interchangeably. Test equating becomes even more challenging in mixed-format tests, such as Advanced Placement Program® (AP®) Exams, that contain both multiple-choice and constructed…
Descriptors: Test Construction, Test Interpretation, Test Norms, Test Reliability
Nese, Joseph F. T.; Lai, Cheng-Fei; Anderson, Daniel; Park, Bitnara Jasmine; Tindal, Gerald; Alonzo, Julie – Behavioral Research and Teaching, 2010
The purpose of this study was to examine the alignment of the easyCBM[R] mathematics benchmark and progress monitoring measures to the National Council of Teachers of Mathematics "Curriculum Focal Points" (NCTM, 2006). Based on Webb's alignment model (1997, 2002), we collected expert judgments on individual math items across a sampling of forms…
Descriptors: Academic Standards, Mathematics Teachers, Benchmarking, Research Reports
Gierl, Mark J.; Leighton, Jacqueline P.; Wang, Changjiang; Zhou, Jiawen; Gokiert, Rebecca; Tan, Adele – College Board, 2009
The purpose of the study is to present research focused on validating the four algebra cognitive models in Gierl, Wang, et al., using student response data collected with protocol analysis methods to evaluate the knowledge structures and processing skills used by a sample of SAT test takers.
Descriptors: Algebra, Mathematics Tests, College Entrance Examinations, Student Attitudes
Hendrickson, Amy B.; Kolen, Michael J. – 2001
This study compared various equating models and procedures for a sample of data from the Medical College Admission Test(MCAT), considering how item response theory (IRT) equating results compare with classical equipercentile results and how the results based on use of various IRT models, observed score versus true score, direct versus linked…
Descriptors: Equated Scores, Higher Education, Item Response Theory, Models
Wright, Benjamin D.; Stone, Mark H. – 1979
This handbook explains how to do Rasch measurement. The emphasis is on practice, but theoretical explanations are also provided. The Forward contains an introduction to the topic of Rasch measurement. Chapters 2, 4, 5, and 7 use a small problem to illustrate the application of Rasch measurement in detail, and methodological issues are considered…
Descriptors: Item Response Theory, Mathematical Models, Measurement Techniques, Psychometrics
Rudner, Lawrence M.; And Others – 1995
Fit statistics provide a direct measure of assessment accuracy by analyzing the fit of measurement models to an individual's (or group's) response pattern. Students that lose interest during the assessment, for example, will miss exercises that are within their abilities. Such students will respond correctly to some more difficult items and…
Descriptors: Difficulty Level, Educational Assessment, Goodness of Fit, Measurement Techniques
Tatsuoka, Kikumi K. – 1988
When learning is taking place, students test their hypotheses and evaluate them, and modify their current theories on the basis of new information. This phenomenon is known as "hypothesis testing view" or "theory changes." Many students change their rules to another while they are taking a test. This study introduced a new…
Descriptors: Cognitive Processes, Elementary Education, Equations (Mathematics), Hypothesis Testing
Samejima, Fumiko – 1984
In order to evaluate our methods and approaches of estimating the operating characteristics of discrete item responses, it is necessary to try other comparable methods on similar sets of data. LOGIST 5 was taken up for this reason, and was tried upon the hypothetical test items, which follow the normal ogive model and were used frequently in…
Descriptors: Computer Simulation, Computer Software, Estimation (Mathematics), Item Analysis
Sheehan, Kathleen M.; Kostin, Irene; Futagi, Yoko; Hemat, Ramin; Zuckerman, Daniel – ETS Research Report Series, 2006
This paper describes the development, implementation, and evaluation of an automated system for predicting the acceptability status of candidate reading-comprehension stimuli extracted from a database of journal and magazine articles. The system uses a combination of classification and regression techniques to predict the probability that a given…
Descriptors: Automation, Prediction, Reading Comprehension, Classification
Samejima, Fumiko – 1984
Simple sum procedure of the conditional PDF approach (plausiblity of distractor function) combined with the normal approach method was applied for estimating the plausibility functions of the distractors of the Level II vocabulary subtest items of the Iowa Tests of Basic Skills. In so doing, the normal ogive model was adopted for the correct…
Descriptors: Adaptive Testing, Elementary Secondary Education, Estimation (Mathematics), Item Analysis
Rasch, Georg – 1993
The psychometric research done by G. Rasch between 1951 and 1959, which is explained and illustrated in this book, takes psychometrics from being purely descriptive to being a science of objective measurement. Individual centered statistics require models in which each individual is characterized separately and from which, given adequate data,…
Descriptors: Achievement Tests, Estimation (Mathematics), Intelligence Tests, Item Response Theory
Rizavi, Saba; Way, Walter D.; Lu, Ying; Pitoniak, Mary; Steffen, Manfred – Online Submission, 2004
The purpose of this study was to use realistically simulated data to evaluate various CAT designs for use with the verbal reasoning measure of the Medical College Admissions Test (MCAT). Factors such as item pool depth, content constraints, and item formats often cause repeated adaptive administrations of an item at ability levels that are not…
Descriptors: Test Items, Test Bias, Item Banks, College Admission
Eignor, Daniel R. – 1985
The feasibility of pre-equating, or establishing conversions from raw to scaled scores through the use of pretest data before operationally administering a test, was investigated for the Scholastic Aptitude Test (SAT). Item-response theory based equating methods were used to estimate item parameters on SAT pretest data, instead of using final form…
Descriptors: College Entrance Examinations, Equated Scores, Estimation (Mathematics), Feasibility Studies
Farish, Stephen J. – 1984
The stability of Rasch test item difficulty parameters was investigated under varying conditions. Data were taken from a mathematics achievement test administered to over 2,000 Australian students. The experiments included: (1) relative stability of the Rasch, traditional, and z-item difficulty parameters using different sample sizes and designs;…
Descriptors: Achievement Tests, Difficulty Level, Estimation (Mathematics), Foreign Countries