ERIC - Search Results

Publication Date

In 2025	0
Since 2024	4
Since 2021 (last 5 years)	7
Since 2016 (last 10 years)	9
Since 2006 (last 20 years)	12

Descriptor

Decision Making	15
Psychometrics	15
Test Items	15
Test Construction	7
Foreign Countries	5
Item Response Theory	5
Second Language Learning	5
Accuracy	4
Elementary Secondary Education	4
English (Second Language)	4
Item Analysis	4
Language Tests	4
Test Validity	4
Computer Assisted Testing	3
Language Proficiency	3
Screening Tests	3
Test Use	3
Administrator Attitudes	2
Alignment (Education)	2
Bayesian Statistics	2
Caregiver Attitudes	2
Classification	2
Comparative Analysis	2
Difficulty Level	2
Elementary School Students	2
More ▼

Source

Grantee Submission	2
Journal of Educational and…	2
Applied Measurement in…	1
Bulletin of Education and…	1
ETS Research Report Series	1
Educational and Psychological…	1
Journal of Psychoeducational…	1
Language Teaching Research	1
Malaysian Journal of Learning…	1
ProQuest LLC	1
School Mental Health	1
More ▼

Publication Type

Journal Articles	11
Reports - Research	11
Books	1
Collected Works - Proceedings	1
Dissertations/Theses -…	1
Information Analyses	1
Opinion Papers	1
Reports - Descriptive	1
Reports - Evaluative	1
Speeches/Meeting Papers	1
Tests/Questionnaires	1
More ▼

Education Level

Early Childhood Education	3
Elementary Education	2
Elementary Secondary Education	2
Higher Education	2
Middle Schools	2
Postsecondary Education	2
Grade 3	1
Grade 4	1
Grade 5	1
Intermediate Grades	1
Junior High Schools	1
Preschool Education	1
Primary Education	1
Secondary Education	1
More ▼

Audience

Researchers

Location

California	1
Florida	1
France	1
Greece	1
Idaho	1
Illinois	1
Indonesia	1
Kansas	1
Minnesota	1
Netherlands	1
New Zealand	1
Pakistan	1
South Korea	1
Utah	1
Vietnam	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

Test of English as a Foreign…

What Works Clearinghouse Rating

Showing all 15 results Save | Export

A Psychometric Framework for Evaluating Fairness in Algorithmic Decision Making: Differential Algorithmic Functioning

Peer reviewed

Direct link

Youmi Suk; Kyung T. Han – Journal of Educational and Behavioral Statistics, 2024

As algorithmic decision making is increasingly deployed in every walk of life, many researchers have raised concerns about fairness-related bias from such algorithms. But there is little research on harnessing psychometric methods to uncover potential discriminatory bias inside decision-making algorithms. The main goal of this article is to…

Descriptors: Psychometrics, Ethics, Decision Making, Algorithms

Item Pool Quality Control in Educational Testing: Change Point Model, Compound Risk, and Sequential Detection

Peer reviewed

Direct link

Chen, Yunxiao; Lee, Yi-Hsuan; Li, Xiaoou – Journal of Educational and Behavioral Statistics, 2022

In standardized educational testing, test items are reused in multiple test administrations. To ensure the validity of test scores, the psychometric properties of items should remain unchanged over time. In this article, we consider the sequential monitoring of test items, in particular, the detection of abrupt changes to their psychometric…

Descriptors: Standardized Tests, Test Items, Test Validity, Scores

The Importance of Increased Processing Demands in the Design of Elicited Imitation Tests

Peer reviewed

Direct link

Rosemary Erlam; Lan Wei – Language Teaching Research, 2024

This study is a conceptual replication of Ellis' 'Measuring implicit and explicit knowledge of a second language: A psychometric study', published in "Studies in Second Language Acquisition" (2005), aiming to establish the importance of including belief statements (hypothesized to increase processing demands) in the design of Elicited…

Descriptors: Language Processing, Language Tests, Second Language Learning, Psychometrics

Exploring Confidence Accuracy and Item Difficulty in Changing Multiple-Choice Answers of Scientific Reasoning Test

Peer reviewed
PDF on ERIC

Download full text

Fadillah, Sarah Meilani; Ha, Minsu; Nuraeni, Eni; Indriyanti, Nurma Yunita – Malaysian Journal of Learning and Instruction, 2023

Purpose: Researchers discovered that when students were given the opportunity to change their answers, a majority changed their responses from incorrect to correct, and this change often increased the overall test results. What prompts students to modify their answers? This study aims to examine the modification of scientific reasoning test, with…

Descriptors: Science Tests, Multiple Choice Tests, Test Items, Decision Making

Developing a Whole Child School Screening Instrument: Evaluating Perceived Usability as an Initial Step in Planning for Consequential Validity

Peer reviewed

Direct link

Jessica B. Koslouski; Sandra M. Chafouleas; Amy Briesch; Jacqueline M. Caemmerer; Brittany Melo – School Mental Health, 2024

We are developing the Equitable Screening to Support Youth (ESSY) Whole Child Screener to address concerns prevalent in existing school-based screenings that impede goals to advance educational equity using universal screeners. Traditional assessment development does not include end users in the early development phases, instead relying on a…

Descriptors: Screening Tests, Psychometrics, Validity, Child Development

Developing a Whole Child School Screening Instrument: Evaluating Perceived Usability as an Initial Step in Planning for Consequential Validity

Peer reviewed

Direct link

Jessica B. Koslouski; Sandra M. Chafouleas; Amy Briesch; Jacqueline M. Caemmerer; Brittany Melo – Grantee Submission, 2024

Descriptors: Screening Tests, Usability, Decision Making, Validity

A Methodology to Validate Foreign Language Teaching Effectiveness Self-Assessment: A Case of the STARTALK-CHELER Teacher Program Questionnaire

Direct link

Shujuan Wang – ProQuest LLC, 2021

Existing methods used to validate self-report questionnaires in foreign language teaching effectiveness have relied on Classical Test Theory (CTT). However, the use of CTT approaches limits the reliability and validity of self-report instruments. The Rasch Model, which is based on the principles of objective measurement, addresses some of the…

Descriptors: Second Language Programs, Second Language Learning, Second Language Instruction, Language Tests

Development and Empirical Evaluation of Indecisiveness Scale for Adolescent Students

Peer reviewed
PDF on ERIC

Download full text

Nawaz, Sehrish; Naveed Riaz, Muhammad; Yasmin, Humaira; Akram Riaz, Muhammad; Batool, Naila – Bulletin of Education and Research, 2017

This study was conducted to develop a valid and reliable indigenous self-report measure of indecisiveness and its empirical evaluation. Sample was consisted of 300 students. The items were constructed on the bases of previous literature and information received by focus groups. The whole Item pool of Indecisiveness Scale was subjected to principal…

Descriptors: Foreign Countries, Test Construction, Test Items, Test Validity

Individual Growth and Development Indicators-Español: Innovation in the Development of Spanish Oral Language General Outcome Measures

Download full text

Direct link

Durán, Lillian K.; Wackerle-Hollman, Alisha K.; Kohlmeier, Theresa L.; Brunner, Stephanie K.; Palma, Jose; Callard, Chase H. – Grantee Submission, 2019

The population of Spanish-speaking preschoolers in the United States continues to increase and there is a significant need to develop psychometrically sound early language and literacy screening measures to accurately capture children's ability in Spanish. In this paper, we describe the innovative design and calibration process of the new…

Descriptors: Spanish Speaking, Preschool Children, Psychometrics, Screening Tests

Using a Model of Analysts' Judgments to Augment an Item Calibration Process

Peer reviewed

Direct link

Hauser, Carl; Thum, Yeow Meng; He, Wei; Ma, Lingling – Educational and Psychological Measurement, 2015

When conducting item reviews, analysts evaluate an array of statistical and graphical information to assess the fit of a field test (FT) item to an item response theory model. The process can be tedious, particularly when the number of human reviews (HR) to be completed is large. Furthermore, such a process leads to decisions that are susceptible…

Descriptors: Test Items, Item Response Theory, Research Methodology, Decision Making

Cognitive Tests in Early Childhood: Psychometric and Cultural Considerations

Peer reviewed

Direct link

Williams, Marian E.; Sando, Lara; Soles, Tamara Glen – Journal of Psychoeducational Assessment, 2014

Cognitive assessment of young children contributes to high-stakes decisions because results are often used to determine eligibility for early intervention and special education. Previous reviews of cognitive measures for young children highlighted concerns regarding adequacy of standardization samples, steep item gradients, and insufficient floors…

Descriptors: Intelligence Tests, Decision Making, High Stakes Tests, Eligibility

Assessing the Test Information Function and Differential Item Functioning for the "TOEFL Junior"® Standard Test. Research Report. ETS RR-13-17. "TOEFL Junior"® Research Report. TOEFL JR-01

Peer reviewed
PDF on ERIC

Download full text

Young, John W.; Morgan, Rick; Rybinski, Paul; Steinberg, Jonathan; Wang, Yuan – ETS Research Report Series, 2013

The "TOEFL Junior"® Standard Test is an assessment that measures the degree to which middle school-aged students learning English as a second language have attained proficiency in the academic and social English skills representative of English-medium instructional environments. The assessment measures skills in three areas: listening…

Descriptors: Item Response Theory, Test Items, Language Tests, Second Language Learning

The Effect of Review on the Psychometric Characteristics of Computerized Adaptive Tests.

Peer reviewed

Stone, Gregory Ethan; Lunz, Mary E. – Applied Measurement in Education, 1994

Effects of reviewing items and altering responses on examinee ability estimates, test precision, test information, decision confidence, and pass/fail status were studied for 376 examinees taking 2 certification tests. Test precision is only slightly affected by review, and average information loss can be recovered by addition of one item. (SLD)

Descriptors: Ability, Adaptive Testing, Certification, Change

A Look at Psychometrics in the Netherlands.

Download full text

Hambleton, Ronald K.; Swaminathan, H. – 1985

Comments are made on the review papers presented by six Dutch psychometricians: Ivo Molenaar, Wim van der Linden, Ed Roskam, Arnold Van den Wollenberg, Gideon Mellenbergh, and Dato de Gruijter. Molenaar has embraced a pragmatic viewpoint on Bayesian methods, using both empirical and pure approaches to solve educational research problems. Molenaar…

Descriptors: Bayesian Statistics, Decision Making, Elementary Secondary Education, Foreign Countries

Social and Technical Issues in Testing: Implications for Test Construction and Usage. Buros-Nebraska Symposium on Measurement and Testing (1st, Lincoln, Nebraska, 1983). Volume 1.

Plake, Barbara S., Ed. – 1984

An introduction by Barbara S. Plake emphasizes that this volume investigates social and technical influences on test development and usage. Essential preliminary information on how tests can be used and may be interpreted is presented. Under the heading "Social and Technical Influences" are: (1) "Struggles and Possibilities: The Use…

Descriptors: Academic Achievement, Achievement Tests, Aptitude Tests, Cognitive Psychology

Amy Briesch	2
Brittany Melo	2
Jacqueline M. Caemmerer	2
Jessica B. Koslouski	2
Sandra M. Chafouleas	2
Akram Riaz, Muhammad	1
Batool, Naila	1
Brunner, Stephanie K.	1
Callard, Chase H.	1
Chen, Yunxiao	1
Durán, Lillian K.	1
Fadillah, Sarah Meilani	1
Ha, Minsu	1
Hambleton, Ronald K.	1
Hauser, Carl	1
He, Wei	1
Indriyanti, Nurma Yunita	1
Kohlmeier, Theresa L.	1
Kyung T. Han	1
Lan Wei	1
Lee, Yi-Hsuan	1
Li, Xiaoou	1
Lunz, Mary E.	1
Ma, Lingling	1
Morgan, Rick	1
More ▼