ERIC - Search Results

Publication Date

In 2025	4
Since 2024	8
Since 2021 (last 5 years)	48
Since 2016 (last 10 years)	122
Since 2006 (last 20 years)	170

Descriptor

Item Response Theory	202
Test Construction	202
Test Reliability	171
Test Validity	120
Test Items	78
Psychometrics	56
Foreign Countries	55
Scoring	34
Scores	33
Measures (Individuals)	32
Item Analysis	27
Difficulty Level	25
Construct Validity	24
High School Students	23
Reliability	23
Scaling	23
Student Evaluation	23
Test Bias	23
Elementary School Students	20
Multiple Choice Tests	20
Grade 3	19
Mathematics Tests	19
Factor Analysis	17
Grade 5	17
Computer Assisted Testing	16
More ▼

Publication Type

Journal Articles	131
Reports - Research	127
Reports - Evaluative	40
Speeches/Meeting Papers	22
Numerical/Quantitative Data	20
Reports - Descriptive	20
Tests/Questionnaires	15
Dissertations/Theses -…	9
Books	3
Collected Works - General	3
Information Analyses	2
Non-Print Media	1
Opinion Papers	1
Reference Materials - General	1
More ▼

Education Level

Higher Education	45
Elementary Education	44
Postsecondary Education	39
Secondary Education	38
Middle Schools	27
High Schools	23
Junior High Schools	22
Early Childhood Education	21
Primary Education	19
Elementary Secondary Education	18
Grade 5	16
Grade 3	15
Grade 4	13
Intermediate Grades	13
Grade 6	11
Grade 7	9
Grade 8	8
Grade 1	6
Grade 2	6
Grade 9	6
Kindergarten	5
Adult Education	2
Grade 10	2
Grade 11	2
Grade 12	2
More ▼

Audience

Practitioners	1
Researchers	1

Location

New York	6
Turkey	6
Australia	5
Indonesia	5
Florida	4
China	3
Germany	3
Jordan	3
Malaysia	3
Nigeria	3
Taiwan	3
United States	3
California	2
Canada	2
Hong Kong	2
Iran	2
Netherlands	2
New Mexico	2
South Africa	2
Texas	2
United Kingdom (England)	2
Alabama	1
Chile (Santiago)	1
Denmark	1
Hawaii	1
More ▼

Laws, Policies, & Programs

Individuals with Disabilities…	1
No Child Left Behind Act 2001	1

What Works Clearinghouse Rating

Showing 1 to 15 of 202 results Save | Export

Detecting Differential Item Functioning among Multiple Groups Using IRT Residual DIF Framework

Peer reviewed

Direct link

Hwanggyu Lim; Danqi Zhu; Edison M. Choe; Kyung T. Han – Journal of Educational Measurement, 2024

This study presents a generalized version of the residual differential item functioning (RDIF) detection framework in item response theory, named GRDIF, to analyze differential item functioning (DIF) in multiple groups. The GRDIF framework retains the advantages of the original RDIF framework, such as computational efficiency and ease of…

Descriptors: Item Response Theory, Test Bias, Test Reliability, Test Construction

Exploring the Influence of Response Styles on Continuous Scale Assessments: Insights from a Novel Modeling Approach

Peer reviewed

Direct link

Hung-Yu Huang – Educational and Psychological Measurement, 2025

The use of discrete categorical formats to assess psychological traits has a long-standing tradition that is deeply embedded in item response theory models. The increasing prevalence and endorsement of computer- or web-based testing has led to greater focus on continuous response formats, which offer numerous advantages in both respondent…

Descriptors: Response Style (Tests), Psychological Characteristics, Item Response Theory, Test Reliability

How Many Response Categories Are Sufficient for Likert Type Scales? An Empirical Study Based on the Item Response Theory

Peer reviewed
PDF on ERIC

Download full text

Aybek, Eren Can; Toraman, Cetin – International Journal of Assessment Tools in Education, 2022

The current study investigates the optimum number of response categories for the Likert type of scales under the item response theory (IRT). The data was collected from university students attend to mainly the faculty of medicine and the faculty of education. A form of the "Social Gender Equity Scale" developed by Gozutok et al. (2017)…

Descriptors: Likert Scales, Item Response Theory, College Students, Test Reliability

Practical Considerations in Choosing an Anchor Test Form for Equating under the Random Groups Design

Peer reviewed

Direct link

Cui, Zhongmin; He, Yong – Measurement: Interdisciplinary Research and Perspectives, 2023

Careful considerations are necessary when there is a need to choose an anchor test form from a list of old test forms for equating under the random groups design. The choice of the anchor form potentially affects the accuracy of equated scores on new test forms. Few guidelines, however, can be found in the literature on choosing the anchor form.…

Descriptors: Test Format, Equated Scores, Best Practices, Test Construction

Development of Pedagogical Competence Scale for Lecturers in Universities Using Item Response Theory

Peer reviewed
PDF on ERIC

Download full text

Olagunju, Barakat Adeoti; Iwintolu, Rukayat Oyebola – Elementary School Forum (Mimbar Sekolah Dasar), 2023

To ascertain whether lecturers had the pedagogical competency skill needed in impacting students with necessary skills they need outside school, this study developed a scale to measure Lecturers' Pedagogical Competence (LPCS) in universities using item response theory. The pedagogical competence of university lecturers was assessed using a set of…

Descriptors: Test Construction, Teacher Competencies, Measures (Individuals), College Faculty

The Relationship of Special Education Teacher Performance on Observation Instruments with Student Outcomes

Peer reviewed
PDF on ERIC

Download full text

Direct link

Johnson, Evelyn S.; Zheng, Yuzhu; Crawford, Angela R.; Moylan, Laura A. – Journal of Learning Disabilities, 2021

In this study, we examined the relationship of special education teachers' performance on the Recognizing Effective Special Education Teachers (RESET) Explicit Instruction observation protocol with student growth on academic measures. Special education teachers provided video-recorded observations of three instructional lessons along with data…

Descriptors: Special Education Teachers, Teacher Effectiveness, Teacher Evaluation, Direct Instruction

Development of Gazi Functional Vision Assessment Instrument

Peer reviewed
PDF on ERIC

Download full text

Safak, Pinar; Cakmak, Salih; Karakoc, Tamer; Aydin O'Dwyer, Pinar – European Journal of Educational Research, 2021

This study aimed to develop a valid and reliable instrument that measures the functional vision of students with low vision. Thus, an assessment tool and performance activities were developed for three vision skill groups (near vision skills, distance vision skills, and visual field) that include functional vision skills. The universe was 1485…

Descriptors: Foreign Countries, Vision Tests, Diagnostic Tests, Vision

Investigating Constructed-Response Scoring over Time: The Effects of Study Design on Trend Rescore Statistics. Research Report. ETS RR-22-15

Peer reviewed
PDF on ERIC

Download full text

Donoghue, John R.; McClellan, Catherine A.; Hess, Melinda R. – ETS Research Report Series, 2022

When constructed-response items are administered for a second time, it is necessary to evaluate whether the current Time B administration's raters have drifted from the scoring of the original administration at Time A. To study this, Time A papers are sampled and rescored by Time B scorers. Commonly the scores are compared using the proportion of…

Descriptors: Item Response Theory, Test Construction, Scoring, Testing

A Novel Examination of None-of-the-Above as It Influences Examinee Item Responses

Direct link

Thompson, Kathryn N. – ProQuest LLC, 2023

It is imperative to collect validity evidence prior to interpreting and using test scores. During the process of collecting validity evidence, test developers should consider whether test scores are contaminated by sources of extraneous information. This is referred to as construct irrelevant variance, or the "degree to which test scores are…

Descriptors: Test Wiseness, Test Items, Item Response Theory, Scores

Development and Validation of a Survey Instrument for Measuring Pre-Service Teachers' Pedagogical Content Knowledge

Peer reviewed

Direct link

Martin, David; Jamieson-Proctor, Romina – International Journal of Research & Method in Education, 2020

In Australia, one of the key findings of the Teacher Education Ministerial Advisory Group was that not all graduating pre-service teachers possess adequate pedagogical content knowledge (PCK) to teach effectively. The concern is that higher education providers working with pre-service teachers are using pedagogical practices and assessments which…

Descriptors: Test Construction, Preservice Teachers, Pedagogical Content Knowledge, Foreign Countries

Development and Validation of a Reading in Science Holistic Assessment (RISHA): A Rasch Measurement Study

Peer reviewed

Direct link

Kason Ka Ching Cheung; Jack K. H. Pun; Xuehua Fu – International Journal of Science and Mathematics Education, 2024

Researchers in science education lacks valid and reliable instruments to assess students' "disciplinary" and "epistemic" reading of scientific texts. The main purpose of this study was to develop and validate a Reading in Science Holistic Assessment (RISHA) to assess students' holistic reading of scientific texts. RISHA…

Descriptors: Test Construction, Reading Tests, Science Education, Student Evaluation

Assessing Tonal Abilities in Elementary School Children: Testing Reliability and Validity of the Implicit Tonal Ability Test Using Rasch Measurement Model

Peer reviewed

Direct link

Zyxcban G. Wolfs; Saskia Brand-Gruwel; Henny P. A. Boshuizen – SAGE Open, 2023

The objective of this study was to develop and validate an instrument measuring the perception and interpretation of several distinct musical features (pitch, tonality, timing, loudness, and timbre). Therefore, we developed the Implicit Tonal Ability Test (ITAT), a listening test containing 49 multiple-choice items. A total of 233 children aged 6…

Descriptors: Elementary School Students, Test Validity, Test Reliability, Age Differences

A Two-Level Adaptive Test Battery

Peer reviewed

Direct link

Wim J. van der Linden; Luping Niu; Seung W. Choi – Journal of Educational and Behavioral Statistics, 2024

A test battery with two different levels of adaptation is presented: a within-subtest level for the selection of the items in the subtests and a between-subtest level to move from one subtest to the next. The battery runs on a two-level model consisting of a regular response model for each of the subtests extended with a second level for the joint…

Descriptors: Adaptive Testing, Test Construction, Test Format, Test Reliability

The Relationship of Special Education Teacher Performance on Observation Instruments with Student Outcomes

Peer reviewed
PDF on ERIC

Download full text

Direct link

Johnson, Evelyn S.; Zheng, Yuzhu; Crawford, Angela R.; Moylan, Laura A. – Grantee Submission, 2020

In this study, we examined the relationship of special education teachers' performance on the RESET Explicit Instruction observation protocol with student growth on academic measures. Special education teachers provided video recorded observations of three instructional lessons along with data from standardized, curriculum-based academic measures…

Descriptors: Special Education Teachers, Teacher Effectiveness, Teacher Evaluation, Direct Instruction

Development and Validation of a Computational Thinking Test for Lower Primary School Students

Peer reviewed

Direct link

Zhang, Shuhan; Wong, Gary K. W. – Educational Technology Research and Development, 2023

Computational thinking (CT) has permeated primary and early childhood education in recent years. Despite the extensive effort in CT learning initiatives, few age-appropriate assessment tools targeting young children have been developed. In this study, we proposed Computational Thinking Test for Lower Primary (CTtLP), which was designed for lower…

Descriptors: Computation, Thinking Skills, Elementary School Students, Test Construction

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 14

ProQuest LLC	9
Grantee Submission	7
Behavioral Research and…	5
International Journal of…	5
Journal of Educational…	5
New York State Education…	5
Online Submission	5
Applied Measurement in…	4
International Journal of…	4
Partnership for Assessment of…	4
Applied Psychological…	3
Educational Assessment	3
Educational Measurement:…	3
Measurement and Evaluation in…	3
Measurement in Physical…	3
SAGE Open	3
AERA Online Paper Repository	2
CBE - Life Sciences Education	2
College Board	2
ETS Research Report Series	2
EURASIA Journal of…	2
Education and Information…	2
Educational Assessment,…	2
Educational and Psychological…	2
Elementary School Journal	2
More ▼

Schoen, Robert C.	6
Alonzo, Julie	5
Tindal, Gerald	5
Bauduin, Charity	4
Irvin, P. Shawn	4
Lai, Cheng-Fei	4
Park, Bitnara Jasmine	4
Anderson, Daniel	3
Wainer, Howard	3
Adam M. Voight	2
Andres Pinedo	2
Avery, Marybell	2
Aybek, Eren Can	2
Biancarosa, Gina	2
Bichi, Ado Abdu	2
Bray, Wendy	2
Carlson, Sarah E.	2
Crawford, Angela R.	2
Davison, Mark L.	2
Dyson, Ben	2
Elise Harris	2
Emanuele Bardelli	2
Fisette, Jennifer L.	2
Fox, Connie	2
More ▼

SAT (College Admission Test)	7
Iowa Tests of Basic Skills	2
Bayley Scales of Infant…	1
Behavior Assessment System…	1
Early Childhood Longitudinal…	1
General Educational…	1
Graduate Record Examinations	1
Hidden Figures Test	1
Home Observation for…	1
Interpersonal Reactivity Index	1
Iowa Tests of Educational…	1
Kaufman Test of Educational…	1
Law School Admission Test	1
Motivated Strategies for…	1
NEO Five Factor Inventory	1
National Assessment of…	1
Peabody Picture Vocabulary…	1
Raven Advanced Progressive…	1
Remote Associates Test	1
Social Skills Improvement…	1
Social Skills Rating System	1
Test of Standard Written…	1
Vineland Adaptive Behavior…	1
Wechsler Adult Intelligence…	1
Woodcock Johnson Tests of…	1
More ▼