ERIC - Search Results

Publication Date

In 2025	0
Since 2024	1
Since 2021 (last 5 years)	6
Since 2016 (last 10 years)	17
Since 2006 (last 20 years)	28

Descriptor

Error of Measurement	53
Test Items	53
Test Reliability	53
Difficulty Level	16
Item Response Theory	16
Test Validity	16
Item Analysis	13
Scores	13
Test Construction	13
Mathematical Models	12
Foreign Countries	9
Psychometrics	8
Simulation	8
Test Length	8
Comparative Analysis	7
Correlation	7
Goodness of Fit	7
Scoring	7
Computer Assisted Testing	6
Sample Size	6
Test Theory	6
Adaptive Testing	5
Computation	5
Cutting Scores	5
Latent Trait Theory	5
More ▼

Publication Type

Reports - Research	39
Journal Articles	31
Reports - Evaluative	8
Speeches/Meeting Papers	7
Reports - Descriptive	4
Numerical/Quantitative Data	3
Dissertations/Theses -…	2
Tests/Questionnaires	1

Education Level

Elementary Education	5
Early Childhood Education	3
Elementary Secondary Education	3
Primary Education	3
Grade 3	2
High Schools	2
Higher Education	2
Middle Schools	2
Postsecondary Education	2
Secondary Education	2
Grade 4	1
Grade 7	1
Grade 9	1
Intermediate Grades	1
Junior High Schools	1
More ▼

Audience

Location

Canada	3
Indonesia	3
Germany	2
Netherlands	2
New Mexico	2
Florida	1
France	1
Maine	1
Malaysia	1
South Africa	1
South Carolina	1
South Korea	1
Spain	1
United Kingdom (Wales)	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

Comprehensive Tests of Basic…	2
Armed Forces Qualification…	1
Expressive One Word Picture…	1
Peabody Picture Vocabulary…	1
Test of English as a Foreign…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 53 results Save | Export

How to Obtain the Most Error-Free Estimate of Reliability? Eight Sources of Deflation in the Estimates of Reliability to Avoid

Peer reviewed
PDF on ERIC

Download full text

Metsämuuronen, Jari – Practical Assessment, Research & Evaluation, 2022

The reliability of a test score is usually underestimated and the deflation may be profound, 0.40 - 0.60 units of reliability or 46 - 71%. Eight root sources of the deflation are discussed and quantified by a simulation with 1,440 real-world datasets: (1) errors in the measurement modelling, (2) inefficiency in the estimator of reliability within…

Descriptors: Test Reliability, Scores, Test Items, Correlation

The Effect of Multiple-Choice Test Items' Difficulty Degree on the Reliability Coefficient and the Standard Error of Measurement Depending on the Item Response Theory (IRT)

Peer reviewed
PDF on ERIC

Download full text

Al-zboon, Habis Saad; Alrekebat, Amjad Farhan – International Journal of Higher Education, 2021

This study aims at identifying the effect of multiple-choice test items' difficulty degree on the reliability coefficient and the standard error of measurement depending on the item response theory IRT. To achieve the objectives of the study, (WinGen3) software was used to generate the IRT parameters (difficulty, discrimination, guessing) for four…

Descriptors: Multiple Choice Tests, Test Items, Difficulty Level, Error of Measurement

The Factor Structure and Measurement Invariance of the Autism Spectrum Quotient-28: A Cross-Cultural Comparison between Malaysia and the Netherlands

Peer reviewed

Direct link

Zhong Jian Chee; Anke M. Scheeren; Marieke de Vries – Autism: The International Journal of Research and Practice, 2024

Despite several psychometric advantages over the 50-item Autism Spectrum Quotient, an instrument used to measure autistic traits, the abridged AQ-28 and its cross-cultural validity have not been examined as extensively. Therefore, this study aimed to examine the factor structure and measurement invariance of the AQ-28 in 818 Dutch (M[subscript…

Descriptors: Autism Spectrum Disorders, Questionnaires, Factor Structure, Factor Analysis

Measuring Language Ability of Students with Compensatory Multidimensional CAT: A Post-Hoc Simulation Study

Peer reviewed

Direct link

Ozdemir, Burhanettin; Gelbal, Selahattin – Education and Information Technologies, 2022

The computerized adaptive tests (CAT) apply an adaptive process in which the items are tailored to individuals' ability scores. The multidimensional CAT (MCAT) designs differ in terms of different item selection, ability estimation, and termination methods being used. This study aims at investigating the performance of the MCAT designs used to…

Descriptors: Scores, Computer Assisted Testing, Test Items, Language Proficiency

Psychometric Properties of the French and German Versions of the Physical Self-Concept Questionnaire for Elementary School Children-Revised (PSCQ-C-R)

Peer reviewed

Direct link

Maïano, Christophe; Thibault, Isabelle; Dreiskämper, Dennis; Henning, Lena; Tietjens, Maike; Aimé, Annie – Measurement in Physical Education and Exercise Science, 2023

The present study sought to examine the psychometric properties of the French and German versions of the Physical Self-Concept Questionnaire for Elementary School Children-Revised (PSCQ-C-R). A sample of 519 children participated in this study. Of those, 197 were French-Canadian and 322 were German. Results support the factor validity and…

Descriptors: Elementary School Students, Self Concept, Human Body, Questionnaires

Precision of Single-Skill Math CBM Time-Series Data: The Effect of Probe Stratification and Set Size

Peer reviewed

Direct link

Solomon, Benjamin G.; Payne, Lexy L.; Campana, Kayla V.; Marr, Erin A.; Battista, Carmela; Silva, Alex; Dawes, Jillian M. – Journal of Psychoeducational Assessment, 2020

Comparatively little research exists on single-skill math (SSM) curriculum-based measurements (CBMs) for the purpose of monitoring growth, as may be done in practice or when monitoring intervention effectiveness within group or single-case research. Therefore, we examined a common variant of SSM-CBM: 1 digit × 1 digit multiplication. Reflecting…

Descriptors: Curriculum Based Assessment, Mathematics Tests, Mathematics Skills, Multiplication

Bayesian Approaches to Test Score Measurement Errors in Student Growth Prediction Models

Direct link

Pei-Hsuan Chiu – ProQuest LLC, 2018

Evidence of student growth is a primary outcome of interest for educational accountability systems. When three or more years of student test data are available, questions around how students grow and what their predicted growth is can be answered. Given that test scores contain measurement error, this error should be considered in growth and…

Descriptors: Bayesian Statistics, Scores, Error of Measurement, Growth Models

When near Means Related: Evidence from Three Web Survey Experiments on Inter-Item Correlations in Grid Questions

Peer reviewed

Direct link

Silber, Henning; Roßmann, Joss; Gummer, Tobias – International Journal of Social Research Methodology, 2018

In this article, we present the results of three question design experiments on inter-item correlations, which tested a grid design against a single-item design. The first and second experiments examined the inter-item correlations of a set with five and seven items, respectively, and the third experiment examined the impact of the question design…

Descriptors: Foreign Countries, Online Surveys, Experiments, Correlation

Student Perceptions of Teaching Quality in Five Countries: A Partial Credit Model Approach to Assess Measurement Invariance

Peer reviewed

Direct link

van der Lans, Rikkert M.; Maulana, Ridwan; Helms-Lorenz, Michelle; Fernández-García, Carmen-María; Chun, Seyeoung; de Jager, Thelma; Irnidayanti, Yulia; Inda-Caro, Mercedes; Lee, Okhwa; Coetzee, Thys; Fadhilah, Nurul; Jeon, Meae; Moorer, Peter – SAGE Open, 2021

This study examines measurement invariance of student perceptions of teaching quality collected in five countries: Indonesia (n students = 6,331), the Netherlands (n students = 6,738), South Africa (n students = 3,422), South Korea (n students = 6,997) and Spain (n students = 4,676). The administered questionnaire was the My Teacher Questionnaire…

Descriptors: Foreign Countries, Student Attitudes, Student Evaluation of Teacher Performance, Teacher Effectiveness

Developing IRT-Based Physics Critical Thinking Skill Test: A CAT to Answer 21st Century Challenge

Peer reviewed
PDF on ERIC

Download full text

Istiyono, Edi; Dwandaru, Wipsar Sunu Brams; Lede, Yulita Adelfin; Rahayu, Farida; Nadapdap, Amipa – International Journal of Instruction, 2019

The objective of this study was to develop Physics critical thinking skill test using computerized adaptive test (CAT) based on item response theory (IRT). This research was a development research using 4-D (define, design, develop, and disseminate). The content validity of the items was proven using Aiken's V. The test trial involved 252 students…

Descriptors: Critical Thinking, Thinking Skills, Cognitive Tests, Physics

Development and Initial Field Test of the 2016 K-TEEM (Knowledge for Teaching Early Elementary Mathematics) Test. Research Report No. 2019-01

Download full text

Direct link

Schoen, Robert C.; Yang, Xiaotong; Tazaz, Amanda M.; Bray, Wendy S.; Farina, Kristy – Grantee Submission, 2019

The "2016 Knowledge for Teaching Early Elementary Mathematics" (2016 K-TEEM) test measures teachers' mathematical knowledge for teaching early elementary mathematics. The 2016 K-TEEM is the third version of the K-TEEM (Schoen, Bray, Wolfe, Tazaz, & Nielsen, 2017). In this report, we present results of the first large-scale field test…

Descriptors: Test Construction, Elementary School Mathematics, Elementary School Teachers, Knowledge Base for Teaching

Effects of Differential Item Functioning on Examinees' Test Performance and Reliability of Test

Peer reviewed

Direct link

Lee, Yi-Hsuan; Zhang, Jinming – International Journal of Testing, 2017

Simulations were conducted to examine the effect of differential item functioning (DIF) on measurement consequences such as total scores, item response theory (IRT) ability estimates, and test reliability in terms of the ratio of true-score variance to observed-score variance and the standard error of estimation for the IRT ability parameter. The…

Descriptors: Test Bias, Test Reliability, Performance, Scores

Item Response Theory: An Introduction to Latent Trait Models to Test and Item Development

Peer reviewed
PDF on ERIC

Download full text

Bichi, Ado Abdu; Talib, Rohaya – International Journal of Evaluation and Research in Education, 2018

Testing in educational system perform a number of functions, the results from a test can be used to make a number of decisions in education. It is therefore well accepted in the education literature that, testing is an important element of education. To effectively utilize the tests in educational policies and quality assurance its validity and…

Descriptors: Item Response Theory, Test Items, Test Construction, Decision Making

Psychometric Report on the Knowledge for Teaching Elementary Fractions Test Administered to Elementary Educators in Six States in Spring 2017. Research Report No. 2018-13

Download full text

Schoen, Robert C.; Yang, Xiaotong; Paek, Insu – Grantee Submission, 2018

This report provides evidence of the substantive and structural validity of the Knowledge for Teaching Elementary Fractions Test. Field-test data were gathered with a sample of 241 elementary educators, including teachers, administrators, and instructional support personnel, in spring 2017, as part of a larger study involving a multisite…

Descriptors: Psychometrics, Pedagogical Content Knowledge, Mathematics Tests, Mathematics Instruction

Examination of Polytomous Items' Psychometric Properties According to Nonparametric Item Response Theory Models in Different Test Conditions

Peer reviewed
PDF on ERIC

Download full text

Sengul Avsar, Asiye; Tavsancil, Ezel – Educational Sciences: Theory and Practice, 2017

This study analysed polytomous items' psychometric properties according to nonparametric item response theory (NIRT) models. Thus, simulated datasets--three different test lengths (10, 20 and 30 items), three sample distributions (normal, right and left skewed) and three samples sizes (100, 250 and 500)--were generated by conducting 20…

Descriptors: Test Items, Psychometrics, Nonparametric Statistics, Item Response Theory

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4

Educational and Psychological…	5
Grantee Submission	3
Applied Psychological…	2
New Mexico Public Education…	2
ProQuest LLC	2
Applied Measurement in…	1
Assessment & Evaluation in…	1
Autism: The International…	1
Behavioral Research and…	1
EURASIA Journal of…	1
Education and Information…	1
Educational Research and…	1
Educational Sciences: Theory…	1
Evaluation and the Health…	1
IEEE Transactions on Education	1
International Journal of…	1
International Journal of…	1
International Journal of…	1
International Journal of…	1
International Journal of…	1
Journal of Education and…	1
Journal of Educational…	1
Journal of Educational…	1
Journal of Psychoeducational…	1
Language Assessment Quarterly	1
More ▼

Huynh, Huynh	3
Schoen, Robert C.	3
Yang, Xiaotong	3
Paek, Insu	2
Patience, Wayne M.	2
Reckase, Mark D.	2
Saunders, Joseph C.	2
Ackerman, Terry A.	1
Aimé, Annie	1
Al-zboon, Habis Saad	1
Alonzo, Julie	1
Alrekebat, Amjad Farhan	1
Altepeter, Tom	1
Anke M. Scheeren	1
Bates, Simon P.	1
Battista, Carmela	1
Benson, Jeri	1
Bichi, Ado Abdu	1
Bock, R. Darrell	1
Bray, Wendy S.	1
Brennan, Robert L.	1
Bristow, M.	1
Burton, Richard F.	1
Campana, Kayla V.	1
More ▼