ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	6
Since 2016 (last 10 years)	56
Since 2006 (last 20 years)	115

Descriptor

Scores	153
Statistical Analysis	153
Test Items	153
Item Analysis	43
Item Response Theory	41
Comparative Analysis	36
Foreign Countries	34
Correlation	33
Difficulty Level	27
Test Construction	26
Second Language Learning	25
Test Bias	25
English (Second Language)	24
Psychometrics	23
Test Validity	23
Test Reliability	22
Mathematics Tests	20
Language Tests	18
College Entrance Examinations	17
College Students	17
Multiple Choice Tests	15
Test Format	14
Goodness of Fit	13
Models	13
Regression (Statistics)	13
More ▼

Publication Type

Reports - Research	113
Journal Articles	111
Reports - Evaluative	18
Tests/Questionnaires	13
Dissertations/Theses -…	9
Reports - Descriptive	8
Speeches/Meeting Papers	7
Numerical/Quantitative Data	3
Guides - Classroom - Learner	1
Guides - General	1
Guides - Non-Classroom	1
Information Analyses	1
Opinion Papers	1
More ▼

Education Level

Higher Education	43
Postsecondary Education	31
Secondary Education	27
Elementary Education	15
High Schools	13
Middle Schools	13
Junior High Schools	9
Grade 8	7
Grade 7	5
Elementary Secondary Education	4
Grade 4	4
Grade 3	3
Grade 5	3
Grade 6	3
Grade 12	2
Preschool Education	2
Early Childhood Education	1
Intermediate Grades	1
More ▼

Audience

Researchers	5
Practitioners	1
Teachers	1

Location

Turkey	5
Japan	3
Kansas	3
United Kingdom	3
Vermont	3
California	2
France	2
Indiana	2
Maryland	2
Massachusetts	2
Michigan	2
Minnesota	2
Netherlands	2
Ohio	2
Oregon	2
Poland	2
Texas	2
United Kingdom (England)	2
Alabama	1
Arizona	1
Australia	1
Austria	1
Canada	1
China	1
Colorado	1
More ▼

Laws, Policies, & Programs

What Works Clearinghouse Rating

Meets WWC Standards without Reservations	1
Meets WWC Standards with or without Reservations	1

Showing 1 to 15 of 153 results Save | Export

Reconceptualization of Coefficient Alpha Reliability for Test Summed and Scaled Scores

Peer reviewed

Direct link

Almehrizi, Rashid S. – Educational Measurement: Issues and Practice, 2022

Coefficient alpha reliability persists as the most common reliability coefficient reported in research. The assumptions for its use are, however, not well-understood. The current paper challenges the commonly used expressions of coefficient alpha and argues that while these expressions are correct when estimating reliability for summed scores,…

Descriptors: Reliability, Scores, Scaling, Statistical Analysis

On the Generalized S-X[superscript 2]-Test of Item Fit: Some Variants, Residuals, and a Graphical Visualization

Peer reviewed

Direct link

Ranger, Jochen; Brauer, Kay – Journal of Educational and Behavioral Statistics, 2022

The generalized S-X[superscript 2]-test is a test of item fit for items with polytomous responses format. The test is based on a comparison of the observed and expected number of responses in strata defined by the test score. In this article, we make four contributions. We demonstrate that the performance of the generalized S-X[superscript 2]-test…

Descriptors: Goodness of Fit, Test Items, Statistical Analysis, Item Response Theory

Detecting Item Preknowledge Using Revisits with Speed and Accuracy

Peer reviewed

Direct link

Demirkaya, Onur; Bezirhan, Ummugul; Zhang, Jinming – Journal of Educational and Behavioral Statistics, 2023

Examinees with item preknowledge tend to obtain inflated test scores that undermine test score validity. With the availability of process data collected in computer-based assessments, the research on detecting item preknowledge has progressed on using both item scores and response times. Item revisit patterns of examinees can also be utilized as…

Descriptors: Test Items, Prior Learning, Knowledge Level, Reaction Time

An Introduction to Statistical Techniques Used for Detecting Anomaly in Test Results

Peer reviewed

Direct link

He, Qingping; Meadows, Michelle; Black, Beth – Research Papers in Education, 2022

A potential negative consequence of high-stakes testing is inappropriate test behaviour involving individuals and/or institutions. Inappropriate test behaviour and test collusion can result in aberrant response patterns and anomalous test scores and invalidate the intended interpretation and use of test results. A variety of statistical techniques…

Descriptors: Statistical Analysis, High Stakes Tests, Scores, Response Style (Tests)

Score Comparability Issues with At-Home Testing and How to Address Them

Peer reviewed

Direct link

Puhan, Gautam; Kim, Sooyeon – Journal of Educational Measurement, 2022

As a result of the COVID-19 pandemic, at-home testing has become a popular delivery mode in many testing programs. When programs offer at-home testing to expand their service, the score comparability between test takers testing remotely and those testing in a test center is critical. This article summarizes statistical procedures that could be…

Descriptors: Scores, Scoring, Comparative Analysis, Testing

Somers' D as an Alternative for the Item-Test and Item-Rest Correlation Coefficients in the Educational Measurement Settings

Peer reviewed
PDF on ERIC

Download full text

Metsämuuronen, Jari – International Journal of Educational Methodology, 2020

Pearson product-moment correlation coefficient between item g and test score X, known as item-test or item-total correlation ("Rit"), and item-rest correlation ("Rir") are two of the most used classical estimators for item discrimination power (IDP). Both "Rit" and "Rir" underestimate IDP caused by the…

Descriptors: Correlation, Test Items, Scores, Difficulty Level

Improvement of Norm Score Quality via Regression-Based Continuous Norming

Peer reviewed

Direct link

Lenhard, Wolfgang; Lenhard, Alexandra – Educational and Psychological Measurement, 2021

The interpretation of psychometric test results is usually based on norm scores. We compared semiparametric continuous norming (SPCN) with conventional norming methods by simulating results for test scales with different item numbers and difficulties via an item response theory approach. Subsequently, we modeled the norm scores based on random…

Descriptors: Test Norms, Scores, Regression (Statistics), Test Items

On the Statistical and Practical Limitations of Thurstonian IRT Models

Peer reviewed

Direct link

Bürkner, Paul-Christian; Schulte, Niklas; Holling, Heinz – Educational and Psychological Measurement, 2019

Forced-choice questionnaires have been proposed to avoid common response biases typically associated with rating scale questionnaires. To overcome ipsativity issues of trait scores obtained from classical scoring approaches of forced-choice items, advanced methods from item response theory (IRT) such as the Thurstonian IRT model have been…

Descriptors: Item Response Theory, Measurement Techniques, Questionnaires, Rating Scales

The Use of Item Scores and Response Times to Detect Examinees Who May Have Benefited from Item Preknowledge

Peer reviewed
PDF on ERIC

Download full text

Direct link

Sinharay, Sandip; Johnson, Matthew S. – Grantee Submission, 2019

According to Wollack and Schoenig (2018), benefitting from item preknowledge is one of the three broad types of test fraud that occur in educational assessments. We use tools from constrained statistical inference to suggest a new statistic that is based on item scores and response times and can be used to detect the examinees who may have…

Descriptors: Scores, Test Items, Reaction Time, Cheating

Research on Psychometric Modeling, Analysis, and Reporting of the National Assessment of Educational Progress

Peer reviewed
PDF on ERIC

Download full text

Direct link

Oranje, Andreas; Kolstad, Andrew – Journal of Educational and Behavioral Statistics, 2019

The design and psychometric methodology of the National Assessment of Educational Progress (NAEP) is constantly evolving to meet the changing interests and demands stemming from a rapidly shifting educational landscape. NAEP has been built on strong research foundations that include conducting extensive evaluations and comparisons before new…

Descriptors: National Competency Tests, Psychometrics, Statistical Analysis, Computation

A Shorter Short Version of Barron's Ego Strength Scale

Peer reviewed

Direct link

Kelly, William E.; Daughtry, Don – College Student Journal, 2018

This study developed an abbreviated form of Barron's (1953) Ego Strength Scale for use in research among college student samples. A version of Barron's scale was administered to 100 undergraduate college students. Using item-total score correlations and internal consistency, the scale was reduced to 18 items (Es18). The Es18 possessed adequate…

Descriptors: Undergraduate Students, Self Concept Measures, Test Length, Scores

Examining Power and Type 1 Error for Step and Item Level Tests of Invariance: Investigating the Effect of the Number of Item Score Levels

Direct link

Ayodele, Alicia Nicole – ProQuest LLC, 2017

Within polytomous items, differential item functioning (DIF) can take on various forms due to the number of response categories. The lack of invariance at this level is referred to as differential step functioning (DSF). The most common DSF methods in the literature are the adjacent category log odds ratio (AC-LOR) estimator and cumulative…

Descriptors: Statistical Analysis, Test Bias, Test Items, Scores

Large Sample Confidence Intervals for Item Response Theory Reliability Coefficients

Peer reviewed

Direct link

Andersson, Björn; Xin, Tao – Educational and Psychological Measurement, 2018

In applications of item response theory (IRT), an estimate of the reliability of the ability estimates or sum scores is often reported. However, analytical expressions for the standard errors of the estimators of the reliability coefficients are not available in the literature and therefore the variability associated with the estimated reliability…

Descriptors: Item Response Theory, Test Reliability, Test Items, Scores

Profile Analyses as Feedback by Evaluating the Balance in Exam Scores

Peer reviewed
PDF on ERIC

Download full text

Vaheoja, Monika; Verhelst, N. D.; Eggen, T.J.H.M. – European Journal of Science and Mathematics Education, 2019

In this article, the authors applied profile analysis to Maths exam data to demonstrate how different exam forms, differing in difficulty and length, can be reported and easily interpreted. The results were presented for different groups of participants and for different institutions in different Maths domains by evaluating the balance. Some…

Descriptors: Feedback (Response), Foreign Countries, Statistical Analysis, Scores

Differential Item Functioning for Accommodated Students with Disabilities: Effect of Differences in Proficiency Distributions

Peer reviewed

Direct link

Quesen, Sarah; Lane, Suzanne – Applied Measurement in Education, 2019

This study examined the effect of similar vs. dissimilar proficiency distributions on uniform DIF detection on a statewide eighth grade mathematics assessment. Results from the similar- and dissimilar-ability reference groups with an SWD focal group were compared for four models: logistic regression, hierarchical generalized linear model (HGLM),…

Descriptors: Test Items, Mathematics Tests, Grade 8, Item Response Theory

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11

ETS Research Report Series	13
ProQuest LLC	9
College Entrance Examination…	7
Applied Psychological…	6
Educational and Psychological…	5
International Journal of…	4
Journal of Educational…	4
Journal of Educational and…	4
Language Testing	4
Applied Measurement in…	3
Educational Assessment	3
Educational Measurement:…	2
Educational Testing Service	2
International Journal of…	2
Journal of Education and…	2
Journal of Education and…	2
Language Assessment Quarterly	2
Online Submission	2
Physical Review Physics…	2
Psychometrika	2
Studies in Second Language…	2
Teaching of Psychology	2
ACT, Inc.	1
African Journal of Research…	1
Behavioral Research and…	1
More ▼

Sinharay, Sandip	5
Dorans, Neil J.	4
Haberman, Shelby J.	3
Kim, Sooyeon	3
Liu, Jinghua	3
Brandriet, Alexandra R.	2
Bretz, Stacey Lowery	2
Feigenbaum, Miriam	2
He, Qingping	2
Jin, Ying	2
Lee, Yi-Hsuan	2
Lesniewska, Justyna	2
Magis, David	2
Meadows, Michelle	2
Moses, Tim	2
Puhan, Gautam	2
Stricker, Lawrence J.	2
Traynor, Anne	2
Wainer, Howard	2
van der Ark, L. Andries	2
Adams, Ray	1
Agnello, Paul	1
Aizawa, Kazumi	1
Akbulut, Fatma Demiray	1
More ▼

SAT (College Admission Test)	12
Test of English as a Foreign…	6
National Assessment of…	4
ACT Assessment	3
Program for International…	3
Graduate Record Examinations	2
Stanford Binet Intelligence…	2
Trends in International…	2
ACT Interest Inventory	1
Advanced Placement…	1
Armed Services Vocational…	1
Iowa Tests of Basic Skills	1
Law School Admission Test	1
National Merit Scholarship…	1
Praxis Series	1
Preliminary Scholastic…	1
TerraNova Multiple Assessments	1
Test of English for…	1
Torrance Tests of Creative…	1
More ▼