ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	9
Since 2006 (last 20 years)	25

Descriptor

Scoring	39
Statistical Analysis	39
Test Items	39
Item Response Theory	12
Comparative Analysis	11
Psychometrics	11
Scores	10
Test Construction	10
Computer Assisted Testing	8
Correlation	8
Item Analysis	8
Test Reliability	8
Language Tests	7
Simulation	7
Test Validity	7
Difficulty Level	6
English (Second Language)	6
Test Bias	6
Adaptive Testing	5
Factor Analysis	5
Foreign Countries	5
Measurement Techniques	5
Test Format	5
Test Theory	5
Testing	5
More ▼

Source

ETS Research Report Series	6
Journal of Educational…	4
ProQuest LLC	3
Journal of Educational and…	2
ACT, Inc.	1
American Journal of…	1
Applied Measurement in…	1
Educational Testing Service	1
Educational and Psychological…	1
Eurasian Journal of…	1
JALT CALL Journal	1
Journal of Applied Testing…	1
Journal of Educational…	1
Language Testing	1
Measurement in Physical…	1
National Center for Education…	1
Online Submission	1
Psychometrika	1
More ▼

Publication Type

Journal Articles	22
Reports - Research	22
Reports - Evaluative	7
Dissertations/Theses -…	3
Speeches/Meeting Papers	3
Tests/Questionnaires	3
ERIC Digests in Full Text	2
ERIC Publications	2
Collected Works - Proceedings	1
Guides - Classroom - Learner	1
Guides - Non-Classroom	1
Opinion Papers	1
Reports - Descriptive	1
More ▼

Education Level

Higher Education	7
Postsecondary Education	4
Secondary Education	4
Elementary Education	2
Elementary Secondary Education	2
Grade 8	2
High Schools	2
Middle Schools	2
Grade 4	1
Grade 7	1
Intermediate Grades	1
Junior High Schools	1
More ▼

Audience

Researchers

Location

California	1
Japan	1
Maryland	1
Poland	1
Turkey	1

Laws, Policies, & Programs

Assessments and Surveys

ACT Assessment	2
SAT (College Admission Test)	2
Test of English as a Foreign…	2
ACT Interest Inventory	1
Graduate Record Examinations	1
National Assessment of…	1
Program for International…	1

What Works Clearinghouse Rating

Meets WWC Standards without Reservations	1
Meets WWC Standards with or without Reservations	1

Showing 1 to 15 of 39 results Save | Export

Score Comparability Issues with At-Home Testing and How to Address Them

Peer reviewed

Direct link

Puhan, Gautam; Kim, Sooyeon – Journal of Educational Measurement, 2022

As a result of the COVID-19 pandemic, at-home testing has become a popular delivery mode in many testing programs. When programs offer at-home testing to expand their service, the score comparability between test takers testing remotely and those testing in a test center is critical. This article summarizes statistical procedures that could be…

Descriptors: Scores, Scoring, Comparative Analysis, Testing

Evaluating the Effectiveness of the Expectation-Maximization (EM) Algorithm for Bayesian Network Calibration

Direct link

Tingir, Seyfullah – ProQuest LLC, 2019

Educators use various statistical techniques to explain relationships between latent and observable variables. One way to model these relationships is to use Bayesian networks as a scoring model. However, adjusting the conditional probability tables (CPT-parameters) to fit a set of observations is still a challenge when using Bayesian networks. A…

Descriptors: Bayesian Statistics, Statistical Analysis, Scoring, Probability

Statistically Comparing the Performance of Multiple Automated Raters across Multiple Items

Peer reviewed

Direct link

Kieftenbeld, Vincent; Boyer, Michelle – Applied Measurement in Education, 2017

Automated scoring systems are typically evaluated by comparing the performance of a single automated rater item-by-item to human raters. This presents a challenge when the performance of multiple raters needs to be compared across multiple items. Rankings could depend on specifics of the ranking procedure; observed differences could be due to…

Descriptors: Automation, Scoring, Comparative Analysis, Test Items

Evaluation of Different Scoring Rules for a Noncognitive Test in Development. Research Report. ETS RR-16-03

Peer reviewed
PDF on ERIC

Download full text

Guo, Hongwen; Zu, Jiyun; Kyllonen, Patrick; Schmitt, Neal – ETS Research Report Series, 2016

In this report, systematic applications of statistical and psychometric methods are used to develop and evaluate scoring rules in terms of test reliability. Data collected from a situational judgment test are used to facilitate the comparison. For a well-developed item with appropriate keys (i.e., the correct answers), agreement among various…

Descriptors: Scoring, Test Reliability, Statistical Analysis, Psychometrics

Item Response Data Analysis Using Stata Item Response Theory Package

Peer reviewed

Direct link

Yang, Ji Seung; Zheng, Xiaying – Journal of Educational and Behavioral Statistics, 2018

The purpose of this article is to introduce and review the capability and performance of the Stata item response theory (IRT) package that is available from Stata v.14, 2015. Using a simulated data set and a publicly available item response data set extracted from Programme of International Student Assessment, we review the IRT package from…

Descriptors: Item Response Theory, Item Analysis, Computer Software, Statistical Analysis

Equating without an Anchor for Nonequivalent Groups of Examinees

Peer reviewed

Direct link

Longford, Nicholas T. – Journal of Educational and Behavioral Statistics, 2015

An equating procedure for a testing program with evolving distribution of examinee profiles is developed. No anchor is available because the original scoring scheme was based on expert judgment of the item difficulties. Pairs of examinees from two administrations are formed by matching on coarsened propensity scores derived from a set of…

Descriptors: Equated Scores, Testing Programs, College Entrance Examinations, Scoring

As a Potential Source of Error, Measuring the Tendency of University Students to Copy the Answers: A Scale Development Study

Peer reviewed
PDF on ERIC

Download full text

Demir, Ergul – Eurasian Journal of Educational Research, 2018

Purpose: The answer-copying tendency has the potential to detect suspicious answer patterns for prior distributions of statistical detection techniques. The aim of this study is to develop a valid and reliable measurement tool as a scale in order to observe the tendency of university students' copying of answers. Also, it is aimed to provide…

Descriptors: College Students, Cheating, Test Construction, Student Behavior

Development and Validation of the Written Communication Assessment of the "HEIghten"® Outcomes Assessment Suite. Research Report. ETS RR-17-53

Peer reviewed
PDF on ERIC

Download full text

Rios, Joseph A.; Sparks, Jesse R.; Zhang, Mo; Liu, Ou Lydia – ETS Research Report Series, 2017

Proficiency with written communication (WC) is critical for success in college and careers. As a result, institutions face a growing challenge to accurately evaluate their students' writing skills to obtain data that can support demands of accreditation, accountability, or curricular improvement. Many current standardized measures, however, lack…

Descriptors: Test Construction, Test Validity, Writing Tests, College Outcomes Assessment

Equating Test Scores (without IRT). Second Edition

Download full text

Livingston, Samuel A. – Educational Testing Service, 2014

This booklet grew out of a half-day class on equating that author Samuel Livingston teaches for new statistical staff at Educational Testing Service (ETS). The class is a nonmathematical introduction to the topic, emphasizing conceptual understanding and practical applications. The class consists of illustrated lectures, interspersed with…

Descriptors: Equated Scores, Scoring, Self Evaluation (Individuals), Scores

Lexical Difficulty--Using Elicited Imitation to Study Child L2

Peer reviewed

Direct link

Campfield, Dorota E. – Language Testing, 2017

This paper reports a post-hoc analysis of the influence of lexical difficulty of cue sentences on performance in an elicited imitation (EI) task to assess oral production skills for 645 child L2 English learners in instructional settings. This formed part of a large-scale investigation into effectiveness of foreign language teaching in Polish…

Descriptors: Difficulty Level, Second Language Learning, Second Language Instruction, Elementary School Students

Speed-Accuracy Response Models: Scoring Rules Based on Response Time and Accuracy

Peer reviewed

Direct link

Maris, Gunter; van der Maas, Han – Psychometrika, 2012

Starting from an explicit scoring rule for time limit tasks incorporating both response time and accuracy, and a definite trade-off between speed and accuracy, a response model is derived. Since the scoring rule is interpreted as a sufficient statistic, the model belongs to the exponential family. The various marginal and conditional distributions…

Descriptors: Item Response Theory, Scoring, Reaction Time, Accuracy

How Accurately Can the Google Web Speech API Recognize and Transcribe Japanese L2 English Learners' Oral Production?

Peer reviewed
PDF on ERIC

Download full text

Ashwell, Tim; Elam, Jesse R. – JALT CALL Journal, 2017

The ultimate aim of our research project was to use the Google Web Speech API to automate scoring of elicited imitation (EI) tests. However, in order to achieve this goal, we had to take a number of preparatory steps. We needed to assess how accurate this speech recognition tool is in recognizing native speakers' production of the test items; we…

Descriptors: English (Second Language), Second Language Learning, Second Language Instruction, Language Tests

An Item-Driven Adaptive Design for Calibrating Pretest Items. Research Report. ETS RR-14-38

Peer reviewed
PDF on ERIC

Download full text

Ali, Usama S.; Chang, Hua-Hua – ETS Research Report Series, 2014

Adaptive testing is advantageous in that it provides more efficient ability estimates with fewer items than linear testing does. Item-driven adaptive pretesting may also offer similar advantages, and verification of such a hypothesis about item calibration was the main objective of this study. A suitability index (SI) was introduced to adaptively…

Descriptors: Adaptive Testing, Simulation, Pretests Posttests, Test Items

A Comparison of Item Calibration Procedures in the Presence of Test Speededness

Peer reviewed

Direct link

Suh, Youngsuk; Cho, Sun-Joo; Wollack, James A. – Journal of Educational Measurement, 2012

In the presence of test speededness, the parameter estimates of item response theory models can be poorly estimated due to conditional dependencies among items, particularly for end-of-test items (i.e., speeded items). This article conducted a systematic comparison of five-item calibration procedures--a two-parameter logistic (2PL) model, a…

Descriptors: Response Style (Tests), Timed Tests, Test Items, Item Response Theory

Psychometric Issues in Organizational Stressor Research: A Review and Implications for Sport Psychology

Peer reviewed

Direct link

Arnold, Rachel; Fletcher, David – Measurement in Physical Education and Exercise Science, 2012

Organizational stressors can potentially elicit a number of undesirable consequences for sport performers. It is, therefore, imperative that psychologists better understand the demands that athletes encounter via their exploration and assessment. However, although researchers have identified a wide range of organizational stressors in competitive…

Descriptors: Athletes, Measures (Individuals), Psychometrics, Stress Variables

Previous Page | Next Page »

Pages: 1 | 2 | 3

Abdellah, Antar Solhy	1
Ali, Usama S.	1
Arnold, Rachel	1
Ashwell, Tim	1
Bailey, Kathleen M., Ed.	1
Boyer, Michelle	1
Braun, Henry I.	1
Campfield, Dorota E.	1
Chang, Hua-Hua	1
Cho, Sun-Joo	1
Davey, Tim	1
Demir, Ergul	1
Deng, Nina	1
Downey, Ronald G.	1
Elam, Jesse R.	1
Fergadiotis, Gerasimos	1
Fletcher, David	1
Frary, Robert B.	1
Gallagher, Carole	1
Gattamorta, Karina A.	1
Guo, Hongwen	1
Hou, Xiaodong	1
Huang, Chun-Wei	1
Hula, William D.	1
More ▼