ERIC - Search Results

Publication Date

In 2025	2
Since 2024	5
Since 2021 (last 5 years)	8
Since 2016 (last 10 years)	24
Since 2006 (last 20 years)	48

Descriptor

Correlation	51
Item Response Theory	51
Test Reliability	51
Test Validity	32
Test Items	20
Psychometrics	19
Foreign Countries	18
Scores	18
Factor Analysis	12
Comparative Analysis	10
Construct Validity	10
Difficulty Level	10
Statistical Analysis	10
Test Construction	9
College Students	8
Test Theory	8
Measures (Individuals)	7
Undergraduate Students	7
Elementary School Students	6
Goodness of Fit	6
Language Tests	6
Mathematics Tests	6
Second Language Learning	6
Achievement Tests	5
English (Second Language)	5
More ▼

Publication Type

Journal Articles	41
Reports - Research	36
Reports - Evaluative	7
Dissertations/Theses -…	4
Speeches/Meeting Papers	3
Reports - Descriptive	2
Guides - General	1
Non-Print Media	1
Numerical/Quantitative Data	1
Reference Materials - General	1

Education Level

Higher Education	17
Postsecondary Education	15
Elementary Education	8
Secondary Education	7
Elementary Secondary Education	5
High Schools	2
Intermediate Grades	2
Junior High Schools	2
Grade 4	1
Grade 6	1
Grade 7	1
Grade 8	1
Middle Schools	1
More ▼

Audience

Practitioners	1
Researchers	1

Location

Germany	3
Hong Kong	3
California	2
Taiwan	2
Texas	2
Arizona	1
China	1
Colombia	1
Colorado	1
Denmark	1
Florida	1
Illinois	1
Indonesia	1
Iran	1
Japan	1
Kansas	1
Kenya	1
Maryland	1
Nebraska (Lincoln)	1
Netherlands	1
New Hampshire	1
New York	1
Oklahoma	1
Saudi Arabia	1
Turkey	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

Motivated Strategies for…	2
Test of English as a Foreign…	2
Trends in International…	2
Defining Issues Test	1
Graduate Record Examinations	1
Peabody Picture Vocabulary…	1
SAT (College Admission Test)	1
Stanford Achievement Tests	1
Students Evaluation of…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 51 results Save | Export

Exploring the Relationship between Motivation and Augmented Reality Presence Using the Augmented Reality Presence Scale (ARPS)

Peer reviewed

Direct link

Enrico Gandolfi; Richard E. Ferdig – Educational Technology Research and Development, 2025

Augmented Reality (AR) is increasingly being adopted in education to foster engagement and interest in a variety of subjects and content areas. However, there is a scarcity of instruments to measure the instructional impact of this innovation. This article addresses this gap in two unique ways. First, it presents validation results of the…

Descriptors: Simulated Environment, Measures (Individuals), Rating Scales, Item Response Theory

Separation of Traits and Extreme Response Style in IRTree Models: The Role of Mimicry Effects for the Meaningful Interpretation of Estimates

Peer reviewed

Direct link

Viola Merhof; Caroline M. Böhm; Thorsten Meiser – Educational and Psychological Measurement, 2024

Item response tree (IRTree) models are a flexible framework to control self-reported trait measurements for response styles. To this end, IRTree models decompose the responses to rating items into sub-decisions, which are assumed to be made on the basis of either the trait being measured or a response style, whereby the effects of such person…

Descriptors: Item Response Theory, Test Interpretation, Test Reliability, Test Validity

Modeling Local Item Dependence in Cloze Tests with the Rasch Model: Applying a New Strategy

Peer reviewed
PDF on ERIC

Download full text

Barno S. Abdullaeva; Diyorjon Abdullaev; Nurislom I. Khursanov; Khurshida B. Kadirova; Laylo Djuraeva – International Journal of Language Testing, 2024

Cloze tests are commonly used in language testing as a quick measure of overall language ability or reading comprehension. A problem for the analysis of cloze tests with item response theory models is that cloze test items are locally dependent. This leads to the violation of the conditional or local independence assumption of IRT models. In this…

Descriptors: Cloze Procedure, Language Tests, Test Items, Correlation

The Creation and Validation of the Arabic Vocabulary Levels Test (Arabic-VLT)

Peer reviewed

Direct link

Abdullah Alamer; Ahmed Al Khateeb; Abdulrahman Alshabeb – Language Assessment Quarterly, 2025

This study introduces the first Arabic Vocabulary Levels Test (Arabic-VLT), created for foreign learners of Arabic. We present compelling evidence to substantiate its validity and reliability. The Arabic-VLT was developed according to five levels, beginning with the most frequently used words (Level 1) to the least frequently used ones (Level 5),…

Descriptors: Arabic, Vocabulary Development, Test Construction, Second Language Learning

Assessing Behavioral Self-Regulation: A Validation Study

Peer reviewed

Direct link

Angelica Garzon Umerenkova; Jesus de la Fuente Arias – Electronic Journal of Research in Educational Psychology, 2024

Introduction: Self-regulation is the ability to adequately plan and manage one's own behavior in a flexible manner. It is a predictor of well-being, health, academic performance, among others. The psychometric characterization of the Self-Regulation Questionnaire-Abbreviated (CAR-abr.) composed of 17 items is presented. A versatile instrument,…

Descriptors: Self Control, Self Management, Questionnaires, Psychometrics

The Impact of Aberrant Response on Reliability and Validity

Peer reviewed

Direct link

Liu, Tour; Sun, Yicong; Li, Zhen; Xin, Tao – Measurement: Interdisciplinary Research and Perspectives, 2019

Aberrant response has an important impact on item parameter estimation, individuals' evaluation, and other statistical analysis. There are various types of aberrant response behaviors in educational and psychological tests, like sleeping, guessing, and plodding. Random response is the most common one. The purpose of this research was to clarify…

Descriptors: Test Reliability, Test Validity, Item Response Theory, Differences

Are Speeded Tests Unfair? Modeling the Impact of Time Limits on the Gender Gap in Mathematics

Peer reviewed

Direct link

Stoevenbelt, Andrea H.; Wicherts, Jelte M.; Flore, Paulette C.; Phillips, Lorraine A. T.; Pietschnig, Jakob; Verschuere, Bruno; Voracek, Martin; Schwabe, Inga – Educational and Psychological Measurement, 2023

When cognitive and educational tests are administered under time limits, tests may become speeded and this may affect the reliability and validity of the resulting test scores. Prior research has shown that time limits may create or enlarge gender gaps in cognitive and academic testing. On average, women complete fewer items than men when a test…

Descriptors: Timed Tests, Gender Differences, Item Response Theory, Correlation

Exploration of Student Cognitive Mathematics Ability Diagnostic Instruments: Validity, Reliability, and Item Characteristics

Peer reviewed
PDF on ERIC

Download full text

Hartono, Wahyu; Hadi, Samsul; Rosnawati, Raden; Retnawati, Heri – Pegem Journal of Education and Instruction, 2023

Researchers design diagnostic assessments to measure students' knowledge structures and processing skills to provide information about their cognitive attribute. The purpose of this study is to determine the instrument's validity and score reliability, as well as to investigate the use of classical test theory to identify item characteristics. The…

Descriptors: Diagnostic Tests, Test Validity, Item Response Theory, Content Validity

The Coding Stages Assessment: Development and Validation of an Instrument for Assessing Young Children's Proficiency in the Scratchjr Programming Language

Peer reviewed

Direct link

de Ruiter, Laura E.; Bers, Marina U. – Computer Science Education, 2022

Background and Context: Despite the increasing implementation of coding in early curricula, there are few valid and reliable assessments of coding abilities for young children. This impedes studying learning outcomes and the development and evaluation of curricula. Objective: Developing and validating a new instrument for assessing young…

Descriptors: Programming Languages, Computer Software, Coding, Computer Science Education

The Specific Academic Learning Self-Efficacy and the Specific Academic Exam Self-Efficacy Scales: Construct and Criterion Validity Revisited Using Rasch Models

Peer reviewed

Direct link

Nielsen, Tine – Cogent Education, 2020

Academic self-efficacy is mostly construed as specific; task-specific, course-specific or domain-specific. Previous research in the Danish university context has shown that the self-efficacy subscale in the Motivated Strategies for Leaning Questionnaire is not a single scale, but consists of two separate course- and activity-specific scales; the…

Descriptors: Academic Achievement, Self Efficacy, Test Wiseness, Construct Validity

System Competence Modelling: Theoretical Foundation and Empirical Validation of a Model Involving Natural, Social and Human-Environment Systems

Peer reviewed

Direct link

Mehren, Rainer; Rempfler, Armin; Buchholz, Janine; Hartig, Johannes; Ulrich-Riedhammer, Eva M. – Journal of Research in Science Teaching, 2018

Constituting a metacognitive strategy, system competence or systems thinking can only assume its assigned key function as a basic concept for the school subject of geography in Germany after a theoretical and empirical foundation has been established. A measurement instrument is required which is suitable both for supporting students and for the…

Descriptors: Models, Metacognition, Competence, Geography

Evaluating Subscore Uses across Multiple Levels: A Case of Reading and Listening Subscores for Young EFL Learners

Peer reviewed

Direct link

Choi, Ikkyu; Papageorgiou, Spiros – Language Testing, 2020

Stakeholders of language tests are often interested in subscores. However, reporting a subscore is not always justified; a subscore should provide reliable and distinct information to be worth reporting. When a subscore is used for decisions across multiple levels (e.g., individual test takers and schools), it needs to be justified for its…

Descriptors: English (Second Language), Language Tests, Second Language Learning, Scores

Item Response Theory: An Introduction to Latent Trait Models to Test and Item Development

Peer reviewed
PDF on ERIC

Download full text

Bichi, Ado Abdu; Talib, Rohaya – International Journal of Evaluation and Research in Education, 2018

Testing in educational system perform a number of functions, the results from a test can be used to make a number of decisions in education. It is therefore well accepted in the education literature that, testing is an important element of education. To effectively utilize the tests in educational policies and quality assurance its validity and…

Descriptors: Item Response Theory, Test Items, Test Construction, Decision Making

A Comparison of Reliability and Precision of Subscore Reporting Methods for a State English Language Proficiency Assessment

Peer reviewed

Direct link

Longabach, Tanya; Peyton, Vicki – Language Testing, 2018

K-12 English language proficiency tests that assess multiple content domains (e.g., listening, speaking, reading, writing) often have subsections based on these content domains; scores assigned to these subsections are commonly known as subscores. Testing programs face increasing customer demands for the reporting of subscores in addition to the…

Descriptors: Comparative Analysis, Test Reliability, Second Language Learning, Language Proficiency

Test Assembly Implications for Providing Reliable and Valid Subscores

Peer reviewed

Direct link

Lee, Minji K.; Sweeney, Kevin; Melican, Gerald J. – Educational Assessment, 2017

This study investigates the relationships among factor correlations, inter-item correlations, and the reliability estimates of subscores, providing a guideline with respect to psychometric properties of useful subscores. In addition, it compares subscore estimation methods with respect to reliability and distinctness. The subscore estimation…

Descriptors: Scores, Test Construction, Test Reliability, Test Validity

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4

Cogent Education	4
Educational and Psychological…	4
ProQuest LLC	4
Language Testing	3
Applied Psychological…	1
Assessment	1
CBE - Life Sciences Education	1
Career Development and…	1
College Board	1
Computer Science Education	1
Educational Assessment	1
Educational Technology…	1
Electronic Journal of…	1
Eurasian Journal of…	1
European Journal of…	1
Florida Center for Reading…	1
Gerontologist	1
Higher Education Research and…	1
International Journal of…	1
International Journal of…	1
International Journal of…	1
Journal of Counseling…	1
Journal of Education and…	1
Journal of Educational…	1
Journal of Interactive Online…	1
More ▼

Wang, Wen-Chung	2
Abdullah Alamer	1
Abdulrahman Alshabeb	1
Ahmed Al Khateeb	1
Angelica Garzon Umerenkova	1
Baghaei, Purya	1
Baghi, Heibatollah	1
Barno S. Abdullaeva	1
Beachy, Rachel Rayburn	1
Beaujean, A. Alexander	1
Bernstein, Ira H.	1
Bers, Marina U.	1
Bichi, Ado Abdu	1
Buchholz, Janine	1
Bulut, Okan	1
Carmody, Thomas J.	1
Caroline M. Böhm	1
Carvajal, Jorge	1
Castle, Courtney	1
Chatterji, Somnath	1
Choi, Ikkyu	1
Church, A. Timothy	1
Cole, Russell	1
Couch, Brian A.	1
Culligan, Brent	1
More ▼