ERIC - Search Results

Publication Date

In 2026	0
Since 2025	8
Since 2022 (last 5 years)	36
Since 2017 (last 10 years)	115
Since 2007 (last 20 years)	378

Descriptor

Test Theory	1166
Test Items	262
Test Reliability	252
Test Construction	246
Test Validity	245
Psychometrics	183
Scores	176
Item Response Theory	168
Foreign Countries	160
Item Analysis	141
Statistical Analysis	134
Higher Education	132
Mathematical Models	132
Measurement Techniques	123
Comparative Analysis	121
Correlation	114
Error of Measurement	114
Latent Trait Theory	112
Test Interpretation	112
Testing	111
Evaluation Methods	106
Models	98
Testing Problems	93
Elementary Secondary Education	90
Difficulty Level	85
More ▼

Education Level

Higher Education	96
Postsecondary Education	66
Secondary Education	50
Elementary Education	40
Elementary Secondary Education	29
Middle Schools	27
High Schools	24
Junior High Schools	22
Grade 8	18
Grade 7	14
Grade 4	13
Grade 6	11
Adult Education	10
Early Childhood Education	10
Grade 5	10
Intermediate Grades	10
Grade 3	9
Primary Education	6
Grade 2	4
Preschool Education	4
Grade 10	3
Grade 9	3
Kindergarten	3
Grade 1	2
Grade 12	2
More ▼

Audience

Researchers	81
Practitioners	42
Teachers	22
Students	6
Administrators	5
Policymakers	4
Counselors	2

Location

United States	17
United Kingdom (England)	15
Canada	14
Australia	13
Turkey	12
Sweden	8
United Kingdom	8
Netherlands	7
Texas	7
New York	6
Taiwan	6
United Kingdom (Great Britain)	6
Florida	5
Japan	5
Spain	5
Tennessee	5
United Kingdom (Wales)	5
California	4
Colorado	4
Israel	4
Chile	3
China	3
Germany	3
Illinois	3
Indonesia	3
More ▼

Laws, Policies, & Programs

No Child Left Behind Act 2001	4
Elementary and Secondary…	3
Individuals with Disabilities…	3

What Works Clearinghouse Rating

Showing 91 to 105 of 1,166 results Save | Export

A Response to "Assessment and Learning: Fields Apart?"

Peer reviewed

Direct link

Goldstein, Harvey – Assessment in Education: Principles, Policy & Practice, 2017

The author's commentary focuses more on the quantitative discussion about educational assessment of the original article than on the idea of the assessment for learning, which did not raise any substantial issues. He starts by offering some general comments on the paper. He feels the authors made a number of assumptions about quantitative…

Descriptors: Educational Assessment, Statistical Analysis, International Assessment, Learning Theories

An Adaptive Test Analysis Based on Students' Motivation

Peer reviewed
PDF on ERIC

Download full text

Yoshioka, Sérgio R. I.; Ishitani, Lucila – Informatics in Education, 2018

Computerized Adaptive Testing (CAT) is now widely used. However, inserting new items into the question bank of a CAT requires a great effort that makes impractical the wide application of CAT in classroom teaching. One solution would be to use the tacit knowledge of the teachers or experts for a pre-classification and calibrate during the…

Descriptors: Student Motivation, Adaptive Testing, Computer Assisted Testing, Item Response Theory

Commentary on Baird, J., Andrich, D., Hopfenbeck, T. N. and Stobart, G., "Assessment and Learning: Fields Apart"

Peer reviewed

Direct link

Scharaschkin, Alex – Assessment in Education: Principles, Policy & Practice, 2017

This issue's featured article, "Assessment and Learning: Fields Apart" (Baird, Andrich, Hopfenbeck, and Stobart 2017) raises issues that are of basic importance for the disciplines of assessment and teaching and learning theory. In this commentary, Alex Scharaschkin restricts his remarks to a few areas. He considers the idea of a…

Descriptors: Educational Assessment, Learning Theories, Test Theory, Psychometrics

Accuracy of a Classical Test Theory-Based Procedure for Estimating the Reliability of a Multistage Test. Research Report. ETS RR-17-02

Peer reviewed
PDF on ERIC

Download full text

Kim, Sooyeon; Livingston, Samuel A. – ETS Research Report Series, 2017

The purpose of this simulation study was to assess the accuracy of a classical test theory (CTT)-based procedure for estimating the alternate-forms reliability of scores on a multistage test (MST) having 3 stages. We generated item difficulty and discrimination parameters for 10 parallel, nonoverlapping forms of the complete 3-stage test and…

Descriptors: Accuracy, Test Theory, Test Reliability, Adaptive Testing

Analysis of Added Value of Subscores with Respect to Classification

Peer reviewed

Direct link

Sinharay, Sandip – Journal of Educational Measurement, 2014

Brennan noted that users of test scores often want (indeed, demand) that subscores be reported, along with total test scores, for diagnostic purposes. Haberman suggested a method based on classical test theory (CTT) to determine if subscores have added value over the total score. One way to interpret the method is that a subscore has added value…

Descriptors: Scores, Test Theory, Classification, Cutting Scores

Using Generalizability Theory to Assess the Score Reliability of Communication Skills of Dentistry Students

Peer reviewed
PDF on ERIC

Download full text

Uzun, N. Bilge; Aktas, Mehtap; Asiret, Semih; Yormaz, Seha – Asian Journal of Education and Training, 2018

The goal of this study is to determine the reliability of the performance points of dentistry students regarding communication skills and to examine the scoring reliability by generalizability theory in balanced random and fixed facet (mixed design) data, considering also the interactions of student, rater and duty. The study group of the research…

Descriptors: Foreign Countries, Generalizability Theory, Scores, Test Reliability

Language Testing: Current Practices and Future Developments

Peer reviewed

Direct link

Tschirner, Erwin – Unterrichtspraxis/Teaching German, 2018

Concepts of second language proficiency and how proficiency may be assessed have changed considerably over the last 20 years. New notions of validity with respect to the interpretation and uses of test scores have begun to shape discussions about test validity and quality assurance in college world language departments, in government, and in…

Descriptors: Language Tests, Testing, Test Theory, German

Effects of Various Simulation Conditions on Latent-Trait Estimates: A Simulation Study

Peer reviewed
PDF on ERIC

Download full text

Kogar, Hakan – International Journal of Assessment Tools in Education, 2018

The aim of this simulation study, determine the relationship between true latent scores and estimated latent scores by including various control variables and different statistical models. The study also aimed to compare the statistical models and determine the effects of different distribution types, response formats and sample sizes on latent…

Descriptors: Simulation, Context Effect, Computation, Statistical Analysis

Determination of Differential Item Functioning (DIF) According to SIBTEST, Lord's [Chi-squared], Raju's Area Measurement and Breslow-Day Methods

Peer reviewed
PDF on ERIC

Download full text

Ayva Yörü, Fatma Gökçen; Atar, Hakan Yavuz – Journal of Pedagogical Research, 2019

The aim of this study is to examine whether the items in the mathematics subtest of the Centralized High School Entrance Placement Test [HSEPT] administered in 2012 by the Ministry of National Education in Turkey show DIF according to gender and type of school. For this purpose, SIBTEST, Breslow-Day, Lord's [chi-squared] and Raju's area…

Descriptors: Test Bias, Mathematics Tests, Test Items, Gender Differences

An Extension of IRT-Based Equating to the Dichotomous Testlet Response Theory Model

Peer reviewed

Direct link

Tao, Wei; Cao, Yi – Applied Measurement in Education, 2016

Current procedures for equating number-correct scores using traditional item response theory (IRT) methods assume local independence. However, when tests are constructed using testlets, one concern is the violation of the local item independence assumption. The testlet response theory (TRT) model is one way to accommodate local item dependence.…

Descriptors: Item Response Theory, Equated Scores, Test Format, Models

"TechCheck": Development and Validation of an Unplugged Assessment of Computational Thinking in Early Childhood Education

Peer reviewed

Direct link

Relkin, Emily; de Ruiter, Laura; Bers, Marina Umaschi – Journal of Science Education and Technology, 2020

There is a need for developmentally appropriate Computational Thinking (CT) assessments that can be implemented in early childhood classrooms. We developed a new instrument called "TechCheck" for assessing CT skills in young children that does not require prior knowledge of computer programming. "TechCheck" is based on…

Descriptors: Developmentally Appropriate Practices, Computation, Thinking Skills, Early Childhood Education

ITC Guidelines on Quality Control in Scoring, Test Analysis, and Reporting of Test Scores

Peer reviewed

Direct link

Allalouf, Avi – International Journal of Testing, 2014

The Quality Control (QC) Guidelines are intended to increase the efficiency, precision, and accuracy of the scoring, analysis, and reporting process of testing. The QC Guidelines focus on large-scale testing operations where multiple forms of tests are created for use on set dates. However, they may also be used for a wide variety of other testing…

Descriptors: Quality Control, Scoring, Test Theory, Scores

Item Construction Using Reflective, Formative, or Rasch Measurement Models: Implications for Group Work

Peer reviewed

Direct link

Peterson, Christina Hamme; Gischlar, Karen L.; Peterson, N. Andrew – Journal for Specialists in Group Work, 2017

Measures that accurately capture the phenomenon are critical to research and practice in group work. The vast majority of group-related measures were developed using the reflective measurement model rooted in classical test theory (CTT). Depending on the construct definition and the measure's purpose, the reflective model may not always be the…

Descriptors: Item Response Theory, Group Activities, Test Theory, Test Items

Facilitating the Interpretation of English Language Proficiency Scores: Combining Scale Anchoring and Test Score Mapping Methodologies

Peer reviewed

Direct link

Powers, Donald; Schedl, Mary; Papageorgiou, Spiros – Language Testing, 2017

The aim of this study was to develop, for the benefit of both test takers and test score users, enhanced "TOEFL ITP"® test score reports that go beyond the simple numerical scores that are currently reported. To do so, we applied traditional scale anchoring (proficiency scaling) to item difficulty data in order to develop performance…

Descriptors: English (Second Language), Second Language Learning, Language Proficiency, Scores

Relationships among Classical Test Theory and Item Response Theory Frameworks via Factor Analytic Models

Peer reviewed

Direct link

Kohli, Nidhi; Koran, Jennifer; Henn, Lisa – Educational and Psychological Measurement, 2015

There are well-defined theoretical differences between the classical test theory (CTT) and item response theory (IRT) frameworks. It is understood that in the CTT framework, person and item statistics are test- and sample-dependent. This is not the perception with IRT. For this reason, the IRT framework is considered to be theoretically superior…

Descriptors: Test Theory, Item Response Theory, Factor Analysis, Models

« Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 78

Educational and Psychological…	63
Psychometrika	48
Journal of Educational…	35
Applied Psychological…	34
ProQuest LLC	26
Educational Measurement:…	23
Language Testing	15
Measurement:…	15
Journal of Educational…	13
Online Submission	13
Assessment in Education:…	12
International Journal of…	12
International Journal of…	11
Applied Measurement in…	10
Journal of Educational and…	10
Journal of Experimental…	8
Alberta Journal of…	7
ETS Research Report Series	7
Journal of School Psychology	7
Annual Review of Applied…	6
Educational Research and…	6
Intelligence	6
Physical Review Physics…	6
Practical Assessment,…	6
School Psychology Review	6
More ▼

Mislevy, Robert J.	20
Zimmerman, Donald W.	15
van der Linden, Wim J.	15
Sinharay, Sandip	9
Andrich, David	8
Haladyna, Tom	7
Wilcox, Rand R.	7
Williams, Richard H.	7
Yen, Wendy M.	7
Brennan, Robert L.	6
Dorans, Neil J.	6
Haberman, Shelby J.	6
Holland, Paul W.	6
Huynh, Huynh	6
Prather, Edward E.	6
Wainer, Howard	6
Baird, Jo-Anne	5
Cliff, Norman	5
Petscher, Yaacov	5
Roid, Gale	5
Thompson, Bruce	5
Tindal, Gerald	5
Zumbo, Bruno D.	5
Engelhard, George, Jr.	4
More ▼

Journal Articles	733
Reports - Research	619
Reports - Evaluative	215
Speeches/Meeting Papers	187
Reports - Descriptive	120
Opinion Papers	113
Information Analyses	67
Dissertations/Theses -…	26
Guides - Non-Classroom	26
Tests/Questionnaires	26
Numerical/Quantitative Data	22
Books	13
Book/Product Reviews	11
Reference Materials -…	8
Collected Works - General	7
Guides - Classroom - Teacher	7
Collected Works - Proceedings	6
ERIC Publications	6
Guides - Classroom - Learner	6
Reports - General	5
Collected Works - Serials	4
Historical Materials	4
Dissertations/Theses -…	2
ERIC Digests in Full Text	2
Guides - General	2
More ▼

SAT (College Admission Test)	23
National Assessment of…	11
Wechsler Intelligence Scale…	11
Armed Services Vocational…	10
ACT Assessment	9
Graduate Record Examinations	7
Comprehensive Tests of Basic…	6
Program for International…	6
Test of English as a Foreign…	6
Trends in International…	5
California Achievement Tests	4
Kaufman Assessment Battery…	4
Stanford Binet Intelligence…	4
Bayley Scales of Infant…	3
Law School Admission Test	3
Stanford Achievement Tests	3
Strengths and Difficulties…	3
ACTFL Oral Proficiency…	2
Advanced Placement…	2
Alabama High School…	2
Childrens Depression Inventory	2
Eysenck Personality Inventory	2
General Aptitude Test Battery	2
Graduate Management Admission…	2
Learning and Study Strategies…	2
More ▼