ERIC - Search Results

Publication Date

In 2026	0
Since 2025	8
Since 2022 (last 5 years)	36
Since 2017 (last 10 years)	115
Since 2007 (last 20 years)	378

Descriptor

Test Theory	1166
Test Items	262
Test Reliability	252
Test Construction	246
Test Validity	245
Psychometrics	183
Scores	176
Item Response Theory	168
Foreign Countries	160
Item Analysis	141
Statistical Analysis	134
Higher Education	132
Mathematical Models	132
Measurement Techniques	123
Comparative Analysis	121
Correlation	114
Error of Measurement	114
Latent Trait Theory	112
Test Interpretation	112
Testing	111
Evaluation Methods	106
Models	98
Testing Problems	93
Elementary Secondary Education	90
Difficulty Level	85
More ▼

Education Level

Higher Education	96
Postsecondary Education	66
Secondary Education	50
Elementary Education	40
Elementary Secondary Education	29
Middle Schools	27
High Schools	24
Junior High Schools	22
Grade 8	18
Grade 7	14
Grade 4	13
Grade 6	11
Adult Education	10
Early Childhood Education	10
Grade 5	10
Intermediate Grades	10
Grade 3	9
Primary Education	6
Grade 2	4
Preschool Education	4
Grade 10	3
Grade 9	3
Kindergarten	3
Grade 1	2
Grade 12	2
More ▼

Audience

Researchers	81
Practitioners	42
Teachers	22
Students	6
Administrators	5
Policymakers	4
Counselors	2

Location

United States	17
United Kingdom (England)	15
Canada	14
Australia	13
Turkey	12
Sweden	8
United Kingdom	8
Netherlands	7
Texas	7
New York	6
Taiwan	6
United Kingdom (Great Britain)	6
Florida	5
Japan	5
Spain	5
Tennessee	5
United Kingdom (Wales)	5
California	4
Colorado	4
Israel	4
Chile	3
China	3
Germany	3
Illinois	3
Indonesia	3
More ▼

Laws, Policies, & Programs

No Child Left Behind Act 2001	4
Elementary and Secondary…	3
Individuals with Disabilities…	3

What Works Clearinghouse Rating

Showing 571 to 585 of 1,166 results Save | Export

DICOT: Analyzing Classroom Tests with the Rasch Model.

Peer reviewed

Masters, Geofferey N. – Educational and Psychological Measurement, 1984

DICOT, a computer program for the Rasch analysis of classroom tests, is described. Results are presented in a self-explanatory form. Person ability and item difficulty estimates are expressed in a familiar metric. Person and item fit statistics provide a diagnosis of individual children and identification of problematic items. (Author/DWH)

Descriptors: Classroom Techniques, Foreign Countries, Item Analysis, Latent Trait Theory

Expanding the Rasch Model to a General Model Having More than One Dimension.

Peer reviewed

Stegelmann, Werner – Psychometrika, 1983

The Rasch model is generalized to a multicomponent model, so that observations of component events are not needed to apply the model. It is shown that the generalized model maintains the property of specific objectivity of the Rasch model. An application to a mathematics test is provided. (Author/JKS)

Descriptors: Estimation (Mathematics), Item Analysis, Latent Trait Theory, Mathematical Models

The Comparative Reliability of Simple and Residualized Difference Scores.

Peer reviewed

Williams, Richard H.; Zimmerman, Donald W. – Journal of Experimental Education, 1982

The reliability of simple difference scores is greater than, less than, or equal to that of residualized difference scores, depending on whether the correlation between pretest and posttest scores is greater than, less than, or equal to the ratio of the standard deviations of pretest and posttest scores. (Author)

Descriptors: Achievement Gains, Comparative Analysis, Correlation, Pretests Posttests

The Assessment of Learning in Higher Education: Guiding Principles.

Peer reviewed

Gravett, Sarah – South African Journal of Higher Education, 1996

Argues that student assessment plays a crucial role in the academic life of college students, and assessment arrangements embody the purposes of higher education. Reviews research suggesting learners' perceptions of course testing procedures is the single most important influence on learning. Outlines six guiding principles of test development to…

Descriptors: College Instruction, Educational Objectives, Higher Education, Student Evaluation

Test Theory Reconceived.

Peer reviewed

Mislevy, Robert J. – Journal of Educational Measurement, 1996

Developments in cognitive and developmental psychology have broadened the inferences researchers want to make about students' learning and the nature and acquisition of knowledge. The principles of inference that led to standard test theory can support inference in the broader context of the cognitive revolution. (SLD)

Descriptors: Cognitive Psychology, Developmental Psychology, Educational Assessment, Educational Research

An Investigation of the Accuracy of Alternative Methods of True Score Estimation in High-Stakes Mixed-Format Examinations.

Peer reviewed

Klinger, Don A.; Rogers, W. Todd – Alberta Journal of Educational Research, 2003

The estimation accuracy of procedures based on classical test score theory and item response theory (generalized partial credit model) were compared for examinations consisting of multiple-choice and extended-response items. Analysis of British Columbia Scholarship Examination results found an error rate of about 10 percent for both methods, with…

Descriptors: Academic Achievement, Educational Testing, Foreign Countries, High Stakes Tests

Computerized Adaptive Testing: A Primer Book Review.

Peer reviewed

Andrich, David – Psychometrika, 1995

This book discusses adapting pencil-and-paper tests to computerized testing. Mention is made of models for graded responses to items and of possibilities beyond pencil-and-paper-tests, but the book is essentially about dichotomously scored test items. Contrasts between item response theory and classical test theory are described. (SLD)

Descriptors: Adaptive Testing, Computer Assisted Testing, Item Response Theory, Scores

Test Equating from Biased Samples, with Application to the Armed Services Vocational Aptitude Battery.

Peer reviewed

Little, Roderick J. A.; Rubin, Donald B. – Journal of Educational and Behavioral Statistics, 1994

Equating a new standard test to an old reference test is considered when samples for equating are not randomly selected from the target population of test takers, identifying two problems from equating from biased samples. An empirical example with data from the Armed Services Vocational Aptitude Battery illustrates the approach. (SLD)

Descriptors: Equated Scores, Military Personnel, Sampling, Statistical Analysis

Confirmatory Factor Analysis and Reliability: Testing Measurement Model Assumptions.

Peer reviewed

Reuterberg, Sven-Eric; Gustafsson, Jan-Eric – Educational and Psychological Measurement, 1992

The use of confirmatory factor analysis by the LISREL program is demonstrated as an assumption-testing method when computing reliability coefficients under different model assumptions. Results indicate that reliability estimates are robust against departure from the assumption of parallelism of test items. (SLD)

Descriptors: Equations (Mathematics), Estimation (Mathematics), Mathematical Models, Robustness (Statistics)

The Relative Importance of Persons, Items, Subtests, and Languages to TOEFL Test Variance.

Peer reviewed

Brown, James Dean – Language Testing, 1999

Explored the relative contributions to Test of English as a Foreign Language (TOEFL) score dependability of various numbers of persons, items, subtests, languages, and their various interactions. Sampled 15,000 test takers, 1000 each from 15 different language backgrounds. (Author/VWL)

Descriptors: English (Second Language), Language Tests, Second Language Learning, Student Characteristics

Informing Stylistic Learning Behavior, Disposition, and Achievement through Ability Subtests--Or, More Illusions of Meaning?

Peer reviewed

McDermott, Paul A.; Glutting, Joseph J. – School Psychology Review, 1997

Reports on empirical studies that assessed continuing claims for utility of subtest analysis. Hierarchical regression and discriminate models were used to determine maximum potential of ability subtests to explain variation in academic achievement, stylistic classroom learning, and test-session behavior. Ipsative subtest scores provide no…

Descriptors: Ability Identification, Academic Ability, Academic Achievement, Classroom Environment

TH-SCORE: A Program for Obtaining Ability Estimates under Different Psychometric Models.

Peer reviewed

Ferrando, Pere J.; Lorenzo, Urbano – Educational and Psychological Measurement, 1998

A program for obtaining ability estimates and their standard errors under a variety of psychometric models is documented. The general models considered are (1) classical test theory; (2) item factor analysis for continuous censored responses; and (3) unidimensional and multidimensional item response theory graded response models. (SLD)

Descriptors: Ability, Error of Measurement, Estimation (Mathematics), Factor Analysis

Quantifying the Effects of Chance in Multiple Choice and True/False Tests: Question Selection and Guessing of Answers.

Peer reviewed

Burton, Richard F. – Assessment & Evaluation in Higher Education, 2001

Describes four measures of test unreliability that quantify effects of question selection and guessing, both separately and together--three chosen for immediacy and one for greater mathematical elegance. Quantifies their dependence on test length and number of answer options per question. Concludes that many multiple choice tests are unreliable…

Descriptors: Guessing (Tests), Mathematical Models, Multiple Choice Tests, Objective Tests

Rater Agreement Indexes for Performance Assessment.

Peer reviewed

Burry-Stock, Judith A.; And Others – Educational and Psychological Measurement, 1996

It is argued that interrater agreement is a psychometric property which is theoretically different from classic reliability. Formulas are presented to illustrate a set of algebraically equivalent rater agreement indices that are intended to provide educational and psychological researchers with a practical way to establish a measure of rater…

Descriptors: Algebra, Educational Research, Interrater Reliability, Measures (Individuals)

How Do Traditional Examination Questions Fare in the Presence of a Computer Algebra System (CAS)?

Peer reviewed

Malabar, Ian; Pountney, Dave – International Journal of Computer Algebra in Mathematics Education, 2001

Describes the outcomes and discusses possible implications for the development of assessment with a Computer Algebra System (CAS) when a group of undergraduate mathematics students, familiar with using a CAS in examinations, tackled an assortment of traditional (i.e., non-CAS type) questions. (Author/MM)

Descriptors: Calculators, Computer Uses in Education, High Stakes Tests, Higher Education

« Previous Page | Next Page »

Pages: 1 | ... | 35 | 36 | 37 | 38 | 39 | 40 | 41 | 42 | 43 | ... | 78

Educational and Psychological…	63
Psychometrika	48
Journal of Educational…	35
Applied Psychological…	34
ProQuest LLC	26
Educational Measurement:…	23
Language Testing	15
Measurement:…	15
Journal of Educational…	13
Online Submission	13
Assessment in Education:…	12
International Journal of…	12
International Journal of…	11
Applied Measurement in…	10
Journal of Educational and…	10
Journal of Experimental…	8
Alberta Journal of…	7
ETS Research Report Series	7
Journal of School Psychology	7
Annual Review of Applied…	6
Educational Research and…	6
Intelligence	6
Physical Review Physics…	6
Practical Assessment,…	6
School Psychology Review	6
More ▼

Mislevy, Robert J.	20
Zimmerman, Donald W.	15
van der Linden, Wim J.	15
Sinharay, Sandip	9
Andrich, David	8
Haladyna, Tom	7
Wilcox, Rand R.	7
Williams, Richard H.	7
Yen, Wendy M.	7
Brennan, Robert L.	6
Dorans, Neil J.	6
Haberman, Shelby J.	6
Holland, Paul W.	6
Huynh, Huynh	6
Prather, Edward E.	6
Wainer, Howard	6
Baird, Jo-Anne	5
Cliff, Norman	5
Petscher, Yaacov	5
Roid, Gale	5
Thompson, Bruce	5
Tindal, Gerald	5
Zumbo, Bruno D.	5
Engelhard, George, Jr.	4
More ▼

Journal Articles	733
Reports - Research	619
Reports - Evaluative	215
Speeches/Meeting Papers	187
Reports - Descriptive	120
Opinion Papers	113
Information Analyses	67
Dissertations/Theses -…	26
Guides - Non-Classroom	26
Tests/Questionnaires	26
Numerical/Quantitative Data	22
Books	13
Book/Product Reviews	11
Reference Materials -…	8
Collected Works - General	7
Guides - Classroom - Teacher	7
Collected Works - Proceedings	6
ERIC Publications	6
Guides - Classroom - Learner	6
Reports - General	5
Collected Works - Serials	4
Historical Materials	4
Dissertations/Theses -…	2
ERIC Digests in Full Text	2
Guides - General	2
More ▼

SAT (College Admission Test)	23
National Assessment of…	11
Wechsler Intelligence Scale…	11
Armed Services Vocational…	10
ACT Assessment	9
Graduate Record Examinations	7
Comprehensive Tests of Basic…	6
Program for International…	6
Test of English as a Foreign…	6
Trends in International…	5
California Achievement Tests	4
Kaufman Assessment Battery…	4
Stanford Binet Intelligence…	4
Bayley Scales of Infant…	3
Law School Admission Test	3
Stanford Achievement Tests	3
Strengths and Difficulties…	3
ACTFL Oral Proficiency…	2
Advanced Placement…	2
Alabama High School…	2
Childrens Depression Inventory	2
Eysenck Personality Inventory	2
General Aptitude Test Battery	2
Graduate Management Admission…	2
Learning and Study Strategies…	2
More ▼