ERIC - Search Results

Publication Date

In 2025	3
Since 2024	12
Since 2021 (last 5 years)	41
Since 2016 (last 10 years)	126
Since 2006 (last 20 years)	395

Descriptor

Test Theory	1161
Test Items	261
Test Reliability	252
Test Construction	245
Test Validity	245
Psychometrics	181
Scores	176
Item Response Theory	165
Foreign Countries	159
Item Analysis	141
Statistical Analysis	134
Higher Education	132
Mathematical Models	132
Measurement Techniques	123
Comparative Analysis	121
Correlation	114
Error of Measurement	113
Latent Trait Theory	112
Test Interpretation	112
Testing	111
Evaluation Methods	106
Models	98
Testing Problems	93
Elementary Secondary Education	90
Multiple Choice Tests	85
More ▼

Education Level

Higher Education	95
Postsecondary Education	65
Secondary Education	48
Elementary Education	39
Elementary Secondary Education	29
Middle Schools	27
High Schools	24
Junior High Schools	22
Grade 8	18
Grade 7	14
Grade 4	13
Grade 6	11
Adult Education	10
Early Childhood Education	10
Grade 5	10
Intermediate Grades	10
Grade 3	9
Primary Education	6
Grade 2	4
Preschool Education	4
Grade 10	3
Grade 9	3
Kindergarten	3
Grade 1	2
Grade 12	2
More ▼

Audience

Researchers	81
Practitioners	42
Teachers	22
Students	6
Administrators	5
Policymakers	4
Counselors	2

Location

United States	17
United Kingdom (England)	15
Canada	14
Australia	13
Turkey	12
Sweden	8
United Kingdom	8
Netherlands	7
Texas	7
New York	6
Taiwan	6
United Kingdom (Great Britain)	6
Florida	5
Japan	5
Spain	5
Tennessee	5
United Kingdom (Wales)	5
California	4
Colorado	4
Israel	4
Chile	3
China	3
Germany	3
Illinois	3
Indonesia	3
More ▼

Laws, Policies, & Programs

No Child Left Behind Act 2001	4
Elementary and Secondary…	3
Individuals with Disabilities…	3

What Works Clearinghouse Rating

Showing 181 to 195 of 1,161 results Save | Export

Explaining Crossing DIF in Polytomous Items Using Differential Step Functioning Effects

Peer reviewed

Direct link

Penfield, Randall D. – Applied Psychological Measurement, 2010

Crossing, or intersecting, differential item functioning (DIF) is a form of nonuniform DIF that exists when the sign of the between-group difference in expected item performance changes across the latent trait continuum. The presence of crossing DIF presents a problem for many statistics developed for evaluating DIF because positive and negative…

Descriptors: Test Bias, Test Items, Statistics, Test Theory

Test Theories, Educational Priorities and Reliability of Public Examinations in England

Peer reviewed

Direct link

Baird, Jo-Anne; Black, Paul – Research Papers in Education, 2013

Much has already been written on the controversies surrounding the use of different test theories in educational assessment. Other authors have noted the prevalence of classical test theory over item response theory in practice. This Special Issue draws together articles based upon work conducted on the Reliability Programme for England's…

Descriptors: Test Theory, Foreign Countries, Test Reliability, Item Response Theory

Impact of Psychometric Decisions on Assessment Outcomes in an Alternate Assessment

Direct link

Rao, Vasanthi – ProQuest LLC, 2012

In 1997, based on the amendments to Individuals with Disabilities Education Act (IDEA), all states were faced with a statutory requirement to develop and implement alternate assessments for students with disabilities unable to participate in the statewide large-scale assessment. States were given the challenge of creating, implementing, and…

Descriptors: Alternative Assessment, Psychometrics, Item Response Theory, Models

Development and Validation of the Star Properties Concept Inventory

Peer reviewed

Direct link

Bailey, Janelle M.; Johnson, Bruce; Prather, Edward E.; Slater, Timothy F. – International Journal of Science Education, 2012

Concept inventories (CIs)--typically multiple-choice instruments that focus on a single or small subset of closely related topics--have been used in science education for more than a decade. This paper describes the development and validation of a new CI for astronomy, the "Star Properties Concept Inventory" (SPCI). Questions cover the areas of…

Descriptors: Educational Strategies, Validity, Testing, Astronomy

Measuring Teaching Best Practice in the Induction Years: Development and Validation of an Item-Level Assessment

Peer reviewed
PDF on ERIC

Download full text

Kingsley, Laurie; Romine, William – European Journal of Educational Research, 2014

Schools and teacher induction programs around the world routinely assess teaching best practice to inform accreditation, tenure/promotion, and professional development decisions. Routine assessment is also necessary to ensure that teachers entering the profession get the assistance they need to develop and succeed. We introduce the Item-Level…

Descriptors: Test Construction, Test Validity, Beginning Teacher Induction, Best Practices

Development of Nonword and Irregular Word Lists for Australian Grade 3 Students Using Rasch Analysis

Peer reviewed

Direct link

Callinan, Sarah; Cunningham, Everarda; Theiler, Stephen – Australian Journal of Learning Difficulties, 2014

Many tests used in educational settings to identify learning difficulties endeavour to pick up only the lowest performers. Yet these tests are generally developed within a Classical Test Theory (CTT) paradigm that assumes that data do not have significant skew. Rasch analysis is more tolerant of skew and was used to validate two newly developed…

Descriptors: Foreign Countries, Reading Tests, Item Response Theory, Elementary School Students

Using Rasch Measurement to Score, Evaluate, and Improve Examinations in an Anatomy Course

Peer reviewed

Direct link

Royal, Kenneth D.; Gilliland, Kurt O.; Kernick, Edward T. – Anatomical Sciences Education, 2014

Any examination that involves moderate to high stakes implications for examinees should be psychometrically sound and legally defensible. Currently, there are two broad and competing families of test theories that are used to score examination data. The majority of instructors outside the high-stakes testing arena rely on classical test theory…

Descriptors: Item Response Theory, Scoring, Evaluation Methods, Anatomy

New Equating Methods and Their Relationships with Levine Observed Score Linear Equating under the Kernel Equating Framework

Peer reviewed

Direct link

Chen, Haiwen; Holland, Paul – Psychometrika, 2010

In this paper, we develop a new curvilinear equating for the nonequivalent groups with anchor test (NEAT) design under the assumption of the classical test theory model, that we name curvilinear Levine observed score equating. In fact, by applying both the kernel equating framework and the mean preserving linear transformation of…

Descriptors: Equated Scores, Test Theory, Test Construction, Guidelines

Generalizability Theory and Classical Test Theory

Peer reviewed

Direct link

Brennan, Robert L. – Applied Measurement in Education, 2011

Broadly conceived, reliability involves quantifying the consistencies and inconsistencies in observed scores. Generalizability theory, or G theory, is particularly well suited to addressing such matters in that it enables an investigator to quantify and distinguish the sources of inconsistencies in observed scores that arise, or could arise, over…

Descriptors: Generalizability Theory, Test Theory, Test Reliability, Item Response Theory

Comments on Neil Dorans's NCME Career Award Address: The Contestant Perspective on Taking Tests--Emanations from the Statue within

Peer reviewed

Direct link

Mislevy, Robert J. – Educational Measurement: Issues and Practice, 2012

This article presents the author's observations on Neil Dorans's NCME Career Award Address: "The Contestant Perspective on Taking Tests: Emanations from the Statue within." He calls attention to some points that Dr. Dorans made in his address, and offers his thoughts in response.

Descriptors: Testing, Test Reliability, Psychometrics, Scores

Assessing Probabilistic Reasoning in Verbal-Numerical and Graphical-Pictorial Formats: An Evaluation of the Psychometric Properties of an Instrument

Peer reviewed

Direct link

Agus, Mirian; Penna, Maria Pietronilla; Peró-Cebollero, Maribel; Guàrdia-Olmos, Joan – EURASIA Journal of Mathematics, Science & Technology Education, 2016

Research on the graphical facilitation of probabilistic reasoning has been characterised by the effort expended to identify valid assessment tools. The authors developed an assessment instrument to compare reasoning performances when problems were presented in verbal-numerical and graphical-pictorial formats. A sample of undergraduate psychology…

Descriptors: Probability, Abstract Reasoning, Thinking Skills, Educational Assessment

A Psychometric Analysis of the Chemical Concepts Inventory

Peer reviewed

Direct link

Barbera, Jack – Journal of Chemical Education, 2013

The Chemical Concepts Inventory (CCI) is a multiple-choice instrument designed to assess the alternate conceptions of students in high school or first-semester college chemistry. The instrument was published in 2002 along with an analysis of its data from a test population. This study supports the initial analysis and expands on the psychometric…

Descriptors: Science Instruction, Secondary School Science, High Schools, College Science

Measurement of Classroom Teaching Quality with Item Response Theory

Peer reviewed
PDF on ERIC

Download full text

Kelcey, Ben; McGinn, Daniel; Hill, Heather – Society for Research on Educational Effectiveness, 2013

Recent policy has charged schools and districts with maintaining highly qualified teachers and differentiating among teachers in terms of their effectiveness (U.S. Department of Education, 2009). This emphasis has driven the development and implementation of teacher quality measures which are increasingly being used to evaluate teachers with…

Descriptors: Teacher Effectiveness, Measures (Individuals), Observation, Teacher Evaluation

Should Global Items on Student Rating Scales Be Used for Summative Decisions?

Peer reviewed

Direct link

Berk, Ronald A. – Journal of Faculty Development, 2013

One of the simplest indicators of teaching or course effectiveness is student ratings on one or more global items from the entire rating scale. That approach seems intuitively sound and easy to use. Global items have even been recommended by a few researchers to get a quick-read, at-a-glance summary for summative decisions about faculty. The…

Descriptors: Rating Scales, Student Evaluation of Teacher Performance, Item Analysis, Test Items

The Examination of Reliability According to Classical Test and Generalizability on a Job Performance Scale

Peer reviewed
PDF on ERIC

Download full text

Yelboga, Atilla; Tavsancil, Ezel – Educational Sciences: Theory and Practice, 2010

In this research, the classical test theory and generalizability theory analyses were carried out with the data obtained by a job performance scale for the years 2005 and 2006. The reliability coefficients obtained (estimated) from the classical test theory and generalizability theory analyses were compared. In classical test theory, test retest…

Descriptors: Test Theory, Generalizability Theory, Job Performance, Measures (Individuals)

« Previous Page | Next Page »

Pages: 1 | ... | 9 | 10 | 11 | 12 | 13 | 14 | 15 | 16 | 17 | ... | 78

Educational and Psychological…	63
Psychometrika	48
Journal of Educational…	35
Applied Psychological…	34
ProQuest LLC	26
Educational Measurement:…	23
Language Testing	15
Measurement:…	15
Journal of Educational…	13
Online Submission	13
Assessment in Education:…	12
International Journal of…	12
Applied Measurement in…	10
International Journal of…	10
Journal of Educational and…	10
Journal of Experimental…	8
Alberta Journal of…	7
ETS Research Report Series	7
Journal of School Psychology	7
Annual Review of Applied…	6
Educational Research and…	6
Intelligence	6
Practical Assessment,…	6
School Psychology Review	6
Astronomy Education Review	5
More ▼

Mislevy, Robert J.	20
Zimmerman, Donald W.	15
van der Linden, Wim J.	15
Sinharay, Sandip	9
Andrich, David	8
Haladyna, Tom	7
Wilcox, Rand R.	7
Williams, Richard H.	7
Yen, Wendy M.	7
Brennan, Robert L.	6
Dorans, Neil J.	6
Haberman, Shelby J.	6
Holland, Paul W.	6
Huynh, Huynh	6
Prather, Edward E.	6
Wainer, Howard	6
Baird, Jo-Anne	5
Cliff, Norman	5
Petscher, Yaacov	5
Roid, Gale	5
Thompson, Bruce	5
Tindal, Gerald	5
Zumbo, Bruno D.	5
Engelhard, George, Jr.	4
More ▼

Journal Articles	728
Reports - Research	615
Reports - Evaluative	214
Speeches/Meeting Papers	187
Reports - Descriptive	120
Opinion Papers	113
Information Analyses	67
Dissertations/Theses -…	26
Guides - Non-Classroom	26
Tests/Questionnaires	26
Numerical/Quantitative Data	22
Books	13
Book/Product Reviews	11
Reference Materials -…	8
Collected Works - General	7
Guides - Classroom - Teacher	7
Collected Works - Proceedings	6
ERIC Publications	6
Guides - Classroom - Learner	6
Reports - General	5
Collected Works - Serials	4
Historical Materials	4
Dissertations/Theses -…	2
ERIC Digests in Full Text	2
Guides - General	2
More ▼

SAT (College Admission Test)	23
National Assessment of…	11
Wechsler Intelligence Scale…	11
Armed Services Vocational…	10
ACT Assessment	9
Graduate Record Examinations	7
Comprehensive Tests of Basic…	6
Test of English as a Foreign…	6
Program for International…	5
Trends in International…	5
California Achievement Tests	4
Kaufman Assessment Battery…	4
Stanford Binet Intelligence…	4
Bayley Scales of Infant…	3
Law School Admission Test	3
Stanford Achievement Tests	3
Strengths and Difficulties…	3
ACTFL Oral Proficiency…	2
Advanced Placement…	2
Alabama High School…	2
Childrens Depression Inventory	2
Eysenck Personality Inventory	2
General Aptitude Test Battery	2
Graduate Management Admission…	2
Learning and Study Strategies…	2
More ▼