NotesFAQContact Us
Collection
Advanced
Search Tips
Audience
Researchers2
Laws, Policies, & Programs
Assessments and Surveys
Test of English as a Foreign…1
What Works Clearinghouse Rating
Showing all 15 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Youmi Suk; Kyung T. Han – Journal of Educational and Behavioral Statistics, 2024
As algorithmic decision making is increasingly deployed in every walk of life, many researchers have raised concerns about fairness-related bias from such algorithms. But there is little research on harnessing psychometric methods to uncover potential discriminatory bias inside decision-making algorithms. The main goal of this article is to…
Descriptors: Psychometrics, Ethics, Decision Making, Algorithms
Peer reviewed Peer reviewed
Direct linkDirect link
Chen, Yunxiao; Lee, Yi-Hsuan; Li, Xiaoou – Journal of Educational and Behavioral Statistics, 2022
In standardized educational testing, test items are reused in multiple test administrations. To ensure the validity of test scores, the psychometric properties of items should remain unchanged over time. In this article, we consider the sequential monitoring of test items, in particular, the detection of abrupt changes to their psychometric…
Descriptors: Standardized Tests, Test Items, Test Validity, Scores
Peer reviewed Peer reviewed
Direct linkDirect link
Rosemary Erlam; Lan Wei – Language Teaching Research, 2024
This study is a conceptual replication of Ellis' 'Measuring implicit and explicit knowledge of a second language: A psychometric study', published in "Studies in Second Language Acquisition" (2005), aiming to establish the importance of including belief statements (hypothesized to increase processing demands) in the design of Elicited…
Descriptors: Language Processing, Language Tests, Second Language Learning, Psychometrics
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Fadillah, Sarah Meilani; Ha, Minsu; Nuraeni, Eni; Indriyanti, Nurma Yunita – Malaysian Journal of Learning and Instruction, 2023
Purpose: Researchers discovered that when students were given the opportunity to change their answers, a majority changed their responses from incorrect to correct, and this change often increased the overall test results. What prompts students to modify their answers? This study aims to examine the modification of scientific reasoning test, with…
Descriptors: Science Tests, Multiple Choice Tests, Test Items, Decision Making
Peer reviewed Peer reviewed
Direct linkDirect link
Jessica B. Koslouski; Sandra M. Chafouleas; Amy Briesch; Jacqueline M. Caemmerer; Brittany Melo – School Mental Health, 2024
We are developing the Equitable Screening to Support Youth (ESSY) Whole Child Screener to address concerns prevalent in existing school-based screenings that impede goals to advance educational equity using universal screeners. Traditional assessment development does not include end users in the early development phases, instead relying on a…
Descriptors: Screening Tests, Psychometrics, Validity, Child Development
Peer reviewed Peer reviewed
Direct linkDirect link
Jessica B. Koslouski; Sandra M. Chafouleas; Amy Briesch; Jacqueline M. Caemmerer; Brittany Melo – Grantee Submission, 2024
We are developing the Equitable Screening to Support Youth (ESSY) Whole Child Screener to address concerns prevalent in existing school-based screenings that impede goals to advance educational equity using universal screeners. Traditional assessment development does not include end users in the early development phases, instead relying on a…
Descriptors: Screening Tests, Usability, Decision Making, Validity
Shujuan Wang – ProQuest LLC, 2021
Existing methods used to validate self-report questionnaires in foreign language teaching effectiveness have relied on Classical Test Theory (CTT). However, the use of CTT approaches limits the reliability and validity of self-report instruments. The Rasch Model, which is based on the principles of objective measurement, addresses some of the…
Descriptors: Second Language Programs, Second Language Learning, Second Language Instruction, Language Tests
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Nawaz, Sehrish; Naveed Riaz, Muhammad; Yasmin, Humaira; Akram Riaz, Muhammad; Batool, Naila – Bulletin of Education and Research, 2017
This study was conducted to develop a valid and reliable indigenous self-report measure of indecisiveness and its empirical evaluation. Sample was consisted of 300 students. The items were constructed on the bases of previous literature and information received by focus groups. The whole Item pool of Indecisiveness Scale was subjected to principal…
Descriptors: Foreign Countries, Test Construction, Test Items, Test Validity
Durán, Lillian K.; Wackerle-Hollman, Alisha K.; Kohlmeier, Theresa L.; Brunner, Stephanie K.; Palma, Jose; Callard, Chase H. – Grantee Submission, 2019
The population of Spanish-speaking preschoolers in the United States continues to increase and there is a significant need to develop psychometrically sound early language and literacy screening measures to accurately capture children's ability in Spanish. In this paper, we describe the innovative design and calibration process of the new…
Descriptors: Spanish Speaking, Preschool Children, Psychometrics, Screening Tests
Peer reviewed Peer reviewed
Direct linkDirect link
Hauser, Carl; Thum, Yeow Meng; He, Wei; Ma, Lingling – Educational and Psychological Measurement, 2015
When conducting item reviews, analysts evaluate an array of statistical and graphical information to assess the fit of a field test (FT) item to an item response theory model. The process can be tedious, particularly when the number of human reviews (HR) to be completed is large. Furthermore, such a process leads to decisions that are susceptible…
Descriptors: Test Items, Item Response Theory, Research Methodology, Decision Making
Peer reviewed Peer reviewed
Direct linkDirect link
Williams, Marian E.; Sando, Lara; Soles, Tamara Glen – Journal of Psychoeducational Assessment, 2014
Cognitive assessment of young children contributes to high-stakes decisions because results are often used to determine eligibility for early intervention and special education. Previous reviews of cognitive measures for young children highlighted concerns regarding adequacy of standardization samples, steep item gradients, and insufficient floors…
Descriptors: Intelligence Tests, Decision Making, High Stakes Tests, Eligibility
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Young, John W.; Morgan, Rick; Rybinski, Paul; Steinberg, Jonathan; Wang, Yuan – ETS Research Report Series, 2013
The "TOEFL Junior"® Standard Test is an assessment that measures the degree to which middle school-aged students learning English as a second language have attained proficiency in the academic and social English skills representative of English-medium instructional environments. The assessment measures skills in three areas: listening…
Descriptors: Item Response Theory, Test Items, Language Tests, Second Language Learning
Peer reviewed Peer reviewed
Stone, Gregory Ethan; Lunz, Mary E. – Applied Measurement in Education, 1994
Effects of reviewing items and altering responses on examinee ability estimates, test precision, test information, decision confidence, and pass/fail status were studied for 376 examinees taking 2 certification tests. Test precision is only slightly affected by review, and average information loss can be recovered by addition of one item. (SLD)
Descriptors: Ability, Adaptive Testing, Certification, Change
Hambleton, Ronald K.; Swaminathan, H. – 1985
Comments are made on the review papers presented by six Dutch psychometricians: Ivo Molenaar, Wim van der Linden, Ed Roskam, Arnold Van den Wollenberg, Gideon Mellenbergh, and Dato de Gruijter. Molenaar has embraced a pragmatic viewpoint on Bayesian methods, using both empirical and pure approaches to solve educational research problems. Molenaar…
Descriptors: Bayesian Statistics, Decision Making, Elementary Secondary Education, Foreign Countries
Plake, Barbara S., Ed. – 1984
An introduction by Barbara S. Plake emphasizes that this volume investigates social and technical influences on test development and usage. Essential preliminary information on how tests can be used and may be interpreted is presented. Under the heading "Social and Technical Influences" are: (1) "Struggles and Possibilities: The Use…
Descriptors: Academic Achievement, Achievement Tests, Aptitude Tests, Cognitive Psychology