ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	1
Since 2017 (last 10 years)	5
Since 2007 (last 20 years)	20

Descriptor

Reliability	47
Test Format	47
Psychometrics	13
Test Construction	12
Test Items	12
Comparative Analysis	10
Scores	10
Validity	10
Item Response Theory	8
Evaluation Methods	7
Error of Measurement	6
Foreign Countries	6
Test Use	6
Adults	5
Classification	5
Computer Assisted Testing	5
Higher Education	5
Scoring	5
College Students	4
Correlation	4
Elementary School Students	4
High School Students	4
Test Length	4
Computer Assisted Instruction	3
Cutting Scores	3
More ▼

Publication Type

Journal Articles	38
Reports - Research	33
Speeches/Meeting Papers	7
Reports - Descriptive	6
Reports - Evaluative	6
Opinion Papers	2
Dissertations/Theses -…	1
Information Analyses	1
Tests/Questionnaires	1

Education Level

Higher Education	7
Postsecondary Education	4
High Schools	2
Early Childhood Education	1
Elementary Education	1
Elementary Secondary Education	1
Grade 11	1
Grade 12	1
Grade 6	1
Middle Schools	1
Preschool Education	1
Secondary Education	1
More ▼

Audience

Researchers	1
Teachers	1

Location

Australia	1
California	1
Finland	1
Italy	1
Luxembourg	1
North Dakota	1
South Korea	1
Turkey	1

Laws, Policies, & Programs

Assessments and Surveys

Advanced Placement…	1
Iowa Tests of Basic Skills	1
Peabody Individual…	1
Peabody Picture Vocabulary…	1
Self Description Questionnaire	1
Wechsler Adult Intelligence…	1
Wisconsin Card Sorting Test	1

What Works Clearinghouse Rating

Showing 1 to 15 of 47 results Save | Export

Using Rasch Analysis to Examine Raters' Expertise Turkish Teacher Candidates' Competency Levels in Writing Different Types of Test Items

Peer reviewed
PDF on ERIC

Download full text

Sayin, Ayfer; Sata, Mehmet – International Journal of Assessment Tools in Education, 2022

The aim of the present study was to examine Turkish teacher candidates' competency levels in writing different types of test items by utilizing Rasch analysis. In addition, the effect of the expertise of the raters scoring the items written by the teacher candidates was examined within the scope of the study. 84 Turkish teacher candidates…

Descriptors: Foreign Countries, Item Response Theory, Evaluators, Expertise

IRT Approaches to Modeling Scores on Mixed-Format Tests

Peer reviewed

Direct link

Lee, Won-Chan; Kim, Stella Y.; Choi, Jiwon; Kang, Yujin – Journal of Educational Measurement, 2020

This article considers psychometric properties of composite raw scores and transformed scale scores on mixed-format tests that consist of a mixture of multiple-choice and free-response items. Test scores on several mixed-format tests are evaluated with respect to conditional and overall standard errors of measurement, score reliability, and…

Descriptors: Raw Scores, Item Response Theory, Test Format, Multiple Choice Tests

Classification Consistency and Accuracy for Mixed-Format Tests

Peer reviewed

Direct link

Kim, Stella Y.; Lee, Won-Chan – Applied Measurement in Education, 2019

This study explores classification consistency and accuracy for mixed-format tests using real and simulated data. In particular, the current study compares six methods of estimating classification consistency and accuracy for seven mixed-format tests. The relative performance of the estimation methods is evaluated using simulated data. Study…

Descriptors: Classification, Reliability, Accuracy, Test Format

Evaluating the Effectiveness of the Expectation-Maximization (EM) Algorithm for Bayesian Network Calibration

Direct link

Tingir, Seyfullah – ProQuest LLC, 2019

Educators use various statistical techniques to explain relationships between latent and observable variables. One way to model these relationships is to use Bayesian networks as a scoring model. However, adjusting the conditional probability tables (CPT-parameters) to fit a set of observations is still a challenge when using Bayesian networks. A…

Descriptors: Bayesian Statistics, Statistical Analysis, Scoring, Probability

Technology-Assisted Vocabulary Learning for EFL Learners: A Meta-Analysis

Peer reviewed

Direct link

Hao, Tao; Wang, Zhe; Ardasheva, Yuliya – Journal of Research on Educational Effectiveness, 2021

This meta-analysis reviewed research between 2012 and 2018 focused on technology-assisted second language (L2) vocabulary learning for English as a foreign language (EFL) learner. A total of 45 studies of 2,374 preschool-to-college EFL students contributed effect sizes to this meta-analysis. Compared with traditional instructional methods, the…

Descriptors: Vocabulary Development, Second Language Learning, Second Language Instruction, English (Second Language)

Determining When Single Scoring for Constructed-Response Items Is as Effective as Double Scoring in Mixed-Format Licensure Tests

Peer reviewed

Direct link

Kim, Sooyeon; Moses, Tim – International Journal of Testing, 2013

The major purpose of this study is to assess the conditions under which single scoring for constructed-response (CR) items is as effective as double scoring in the licensure testing context. We used both empirical datasets of five mixed-format licensure tests collected in actual operational settings and simulated datasets that allowed for the…

Descriptors: Scoring, Test Format, Licensing Examinations (Professions), Test Items

Internet Administration of the Paper-and-Pencil Gifted Rating Scale: Assessing Psychometric Equivalence

Peer reviewed

Direct link

Yarnell, Jordy B.; Pfeiffer, Steven I. – Journal of Psychoeducational Assessment, 2015

The present study examined the psychometric equivalence of administering a computer-based version of the Gifted Rating Scale (GRS) compared with the traditional paper-and-pencil GRS-School Form (GRS-S). The GRS-S is a teacher-completed rating scale used in gifted assessment. The GRS-Electronic Form provides an alternative method of administering…

Descriptors: Gifted, Psychometrics, Rating Scales, Computer Assisted Testing

ETS Psychometric Contributions: Focus on Test Scores. Research Report. ETS RR-13-15. ETS R&D Scientific and Policy Contributions Series. ETS SPC-13-03

Peer reviewed
PDF on ERIC

Download full text

Moses, Tim – ETS Research Report Series, 2013

The purpose of this report is to review ETS psychometric contributions that focus on test scores. Two major sections review contributions based on assessing test scores' measurement characteristics and other contributions about using test scores as predictors in correlational and regression relationships. An additional section reviews additional…

Descriptors: Psychometrics, Scores, Correlation, Regression (Statistics)

Examining Increased Flexibility in Assessment Formats

Peer reviewed

Direct link

Irwin, Brian; Hepplestone, Stuart – Assessment & Evaluation in Higher Education, 2012

There have been calls in the literature for changes to assessment practices in higher education, to increase flexibility and give learners more control over the assessment process. This article explores the possibilities of allowing student choice in the format used to present their work, as a starting point for changing assessment, based on…

Descriptors: Student Evaluation, College Students, Selection, Computer Assisted Testing

Linking Outcomes from Peabody Picture Vocabulary Test Forms Using Item Response Models

Peer reviewed

Direct link

Hoffman, Lesa; Templin, Jonathan; Rice, Mabel L. – Journal of Speech, Language, and Hearing Research, 2012

Purpose: The present work describes how vocabulary ability as assessed by 3 different forms of the Peabody Picture Vocabulary Test (PPVT; Dunn & Dunn, 1997) can be placed on a common latent metric through item response theory (IRT) modeling, by which valid comparisons of ability between samples or over time can then be made. Method: Responses…

Descriptors: Item Response Theory, Test Format, Vocabulary, Comparative Analysis

Testing the Reliability of Delay Discounting of Ten Commodities Using the Fill-in-the-Blank Method

Peer reviewed

Direct link

Weatherly, Jeffrey N.; Derenne, Adam; Terrell, Heather K. – Psychological Record, 2011

Several measures of delay discounting have been shown to be reliable over periods of up to 3 months. In the present study, 115 participants completed a fill-in-the-blank (FITB) delay-discounting task on sets of 5 different commodities, 12 weeks apart. Results showed that discounting rates were not well described by a hyperbolic function but were…

Descriptors: Delay of Gratification, Reliability, Test Format, Measures (Individuals)

The Differences among Three-, Four-, and Five-Option-Item Formats in the Context of a High-Stakes English-Language Listening Test

Peer reviewed

Direct link

Lee, HyeSun; Winke, Paula – Language Testing, 2013

We adapted three practice College Scholastic Ability Tests (CSAT) of English listening, each with five-option items, to create four- and three-option versions by asking 73 Korean speakers or learners of English to eliminate the least plausible options in two rounds. Two hundred and sixty-four Korean high school English-language learners formed…

Descriptors: Academic Ability, Stakeholders, Reliability, Listening Comprehension Tests

Psychometric Properties of Raw and Scale Scores on Mixed-Format Tests

Peer reviewed

Direct link

Kolen, Michael J.; Lee, Won-Chan – Educational Measurement: Issues and Practice, 2011

This paper illustrates that the psychometric properties of scores and scales that are used with mixed-format educational tests can impact the use and interpretation of the scores that are reported to examinees. Psychometric properties that include reliability and conditional standard errors of measurement are considered in this paper. The focus is…

Descriptors: Test Use, Test Format, Error of Measurement, Raw Scores

Incomplete Psychometric Equivalence of Scores Obtained on the Manual and the Computer Version of the Wisconsin Card Sorting Test?

Peer reviewed

Direct link

Steinmetz, Jean-Paul; Brunner, Martin; Loarer, Even; Houssemand, Claude – Psychological Assessment, 2010

The Wisconsin Card Sorting Test (WCST) assesses executive and frontal lobe function and can be administered manually or by computer. Despite the widespread application of the 2 versions, the psychometric equivalence of their scores has rarely been evaluated and only a limited set of criteria has been considered. The present experimental study (N =…

Descriptors: Computer Assisted Testing, Psychometrics, Test Theory, Scores

A Note on the Score Reliability for the Satisfaction with Life Scale: An RG Study

Peer reviewed

Direct link

Vassar, Matt – Social Indicators Research, 2008

The purpose of the present study was to meta-analytically investigate the score reliability for the Satisfaction With Life Scale. Four-hundred and sixteen articles using the measure were located through electronic database searches and then separated to identify studies which had calculated reliability estimates from their own data. Sixty-two…

Descriptors: Test Format, Life Satisfaction, Reliability, Measures (Individuals)

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4

Educational and Psychological…	3
International Journal of…	3
Journal of Educational…	3
Applied Measurement in…	2
Applied Psychological…	2
Journal of Applied Measurement	2
Advances in Health Sciences…	1
Assessment	1
Assessment & Evaluation in…	1
ETS Research Report Series	1
Educational Assessment	1
Educational Measurement:…	1
International Journal of…	1
Journal of Experimental…	1
Journal of Marriage and the…	1
Journal of Outcome Measurement	1
Journal of Psychoeducational…	1
Journal of Research on…	1
Journal of Speech, Language,…	1
Journal of Technology,…	1
Journal of Youth and…	1
Language Testing	1
Mid-Western Educational…	1
National Center for Research…	1
New Directions for Teaching…	1
More ▼

Lee, Won-Chan	3
Fisher, Anne G.	2
Kim, Stella Y.	2
Moses, Tim	2
Mott, Michael S.	2
Sykes, Robert C.	2
Almond, Patricia	1
Ardasheva, Yuliya	1
Barnette, J. Jackson	1
Brennan, Robert L.	1
Brunner, Martin	1
Chang, Lei	1
Cheng, Ying-Yao	1
Choi, Jiwon	1
Choi, Kilchan	1
Demsky, Yvonne I.	1
Derenne, Adam	1
Dietz, Thomas	1
Downing, Steven M.	1
Duran, Leslie	1
Earley, Mark A.	1
Eichelberger, R. Tony	1
Ellison, Stephanie	1
Feldt, Leonard S.	1
More ▼