ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	2
Since 2016 (last 10 years)	6
Since 2006 (last 20 years)	22

Descriptor

Licensing Examinations…	23
Statistical Analysis	13
Teacher Certification	12
Equated Scores	9
Comparative Analysis	8
Scores	8
Computation	5
Sample Size	5
Test Items	5
Accuracy	4
Elementary School Teachers	4
Correlation	3
Ethnicity	3
Gender Differences	3
Models	3
Multiple Choice Tests	3
Prediction	3
Racial Differences	3
Testing Programs	3
Achievement Gains	2
Bayesian Statistics	2
Error of Measurement	2
High Stakes Tests	2
Item Response Theory	2
Mathematics	2
More ▼

Source

ETS Research Report Series

Publication Type

Journal Articles	23
Reports - Research	22
Speeches/Meeting Papers	2
Tests/Questionnaires	2
Information Analyses	1
Reports - Evaluative	1

Education Level

Elementary Education	4
Elementary Secondary Education	1
Higher Education	1
Junior High Schools	1
Middle Schools	1
Postsecondary Education	1
Secondary Education	1

Audience

Location

New Jersey

Laws, Policies, & Programs

Assessments and Surveys

Praxis Series	9
Pre Professional Skills Tests	1
SAT (College Admission Test)	1

What Works Clearinghouse Rating

Showing 1 to 15 of 23 results Save | Export

Evaluating Targeted Double Scoring for the Performance Assessment for School Leaders Using Imputation and Decision Theory. Research Report. ETS RR-23-01

Peer reviewed
PDF on ERIC

Download full text

Jing Miao; Sandip Sinharay; Chris Kelbaugh; Yi Cao; Wei Wang – ETS Research Report Series, 2023

In a targeted double-scoring procedure for performance assessments that are used for licensure and certification purposes, a subset of responses receives an independent second rating if the first rating falls into a preidentified critical score range (CSR) where an additional rating would lead to considerably more reliable pass-fail decisions.…

Descriptors: Scoring, Performance Based Assessment, Licensing Examinations (Professions), Certification

The Relationship between "Praxis"® Core Academic Skills for Educators Test and "Praxis"® Subject Assessment Scores: Validity Coefficients and Differential Prediction Analysis by Race/Ethnicity. Research Report. ETS RR-21-23

Peer reviewed
PDF on ERIC

Download full text

Buzick, Heather – ETS Research Report Series, 2021

The "Praxis"® Core Academic Skills for Educators (Core) tests are used in the teacher preparation program admissions process and as part of initial teacher licensure. The purpose of this study was to estimate the relationship between scores on Praxis Core tests and Praxis Subject Assessments and to test for differential prediction by…

Descriptors: Teacher Certification, Licensing Examinations (Professions), Prediction, Teacher Education Programs

Stakes in Testing: Not a Simple Dichotomy but a Profile of Consequences That Guides Needed Evidence of Measurement Quality. Research Report. ETS RR-19-19

Peer reviewed
PDF on ERIC

Download full text

Tannenbaum, Richard J.; Kane, Michael T. – ETS Research Report Series, 2019

Testing programs are often classified as high or low stakes to indicate how stringently they need to be evaluated. However, in practice, this classification falls short. A high-stakes label is taken to imply that all indicators of measurement quality must meet high standards; whereas a low-stakes label is taken to imply the opposite. This approach…

Descriptors: High Stakes Tests, Testing Programs, Measurement, Evaluation Criteria

Anchored Graphical Representations: A Graphical Alternative to Traditional Just Qualified Candidate Descriptors for Licensure Tests. Research Report. ETS RR-18-40

Peer reviewed
PDF on ERIC

Download full text

Kannan, Priya; Tannenbaum, Richard J.; Hebert, Delano – ETS Research Report Series, 2018

A well-constructed just qualified candidate (JQC) description is needed to arrive at a reasonable passing score for licensure tests. Traditionally, such descriptions consist of a list of knowledge and skill statements without sufficient context to internalize its intended meaning, allowing the standard-setting panelists to make idiosyncratic…

Descriptors: Licensing Examinations (Professions), Visual Aids, Standard Setting, Mathematics Teachers

"PRAXIS"® Content Knowledge for Teaching: Initial Reliability and Validity Results for Elementary Reading Language Arts and Mathematics. Research Report. ETS RR-20-15

Peer reviewed
PDF on ERIC

Download full text

Phelps, Geoffrey; Steinberg, Jonathan; Leusner, Dawn; Minsky, Jennifer; Castellano, Karen; McCulla, Laura – ETS Research Report Series, 2020

The primary purpose of this report is to provide preliminary evidence on the measurement properties for newly designed assessments of content knowledge for teaching (CKT) in elementary reading language arts (RLA) and mathematics. The goal is to offer the CKT tests through the "PRAXIS"® assessment. Additional analyses were conducted to…

Descriptors: Elementary School Teachers, Pedagogical Content Knowledge, Language Arts, Mathematics

Does Retest Effect Impact Test Performance of Repeaters in Different Subgroups? Research Report. ETS RR-20-16

Peer reviewed
PDF on ERIC

Download full text

Zhou, Jiawen; Cao, Yi – ETS Research Report Series, 2020

In this study, we explored retest effects on test scores and response time for repeaters, examinees who retake an examination. We looked at two groups of repeaters: those who took the same form twice and those who took different forms on their two attempts for a certification and licensure test. Scores improved over the two test attempts, and…

Descriptors: Testing, Test Items, Computer Assisted Testing, Licensing Examinations (Professions)

Evaluation of "e-rater"® for the "Praxis I"®Writing Test. Research Report. ETS RR-15-03

Peer reviewed
PDF on ERIC

Download full text

Ramineni, Chaitanya; Trapani, Catherine S.; Williamson, David M. – ETS Research Report Series, 2015

Automated scoring models were trained and evaluated for the essay task in the "Praxis I"® writing test. Prompt-specific and generic "e-rater"® scoring models were built, and evaluation statistics, such as quadratic weighted kappa, Pearson correlation, and standardized differences in mean scores, were examined to evaluate the…

Descriptors: Writing Tests, Licensing Examinations (Professions), Teacher Competency Testing, Scoring

Constructed-Response DIF Evaluations for Mixed-Format Tests. Research Report. ETS RR-13-33

Peer reviewed
PDF on ERIC

Download full text

Moses, Tim; Liu, Jinghua; Tan, Adele; Deng, Weiling; Dorans, Neil J. – ETS Research Report Series, 2013

In this study, differential item functioning (DIF) methods utilizing 14 different matching variables were applied to assess DIF in the constructed-response (CR) items from 6 forms of 3 mixed-format tests. Results suggested that the methods might produce distinct patterns of DIF results for different tests and testing programs, in that the DIF…

Descriptors: Test Construction, Multiple Choice Tests, Test Items, Item Analysis

Theoretical and Empirical Standard Errors for Two Population Invariance Measures in the Linear Equating Case. Research Report. ETS RR-08-24

Peer reviewed
PDF on ERIC

Download full text

von Davier, Alina A.; Manalo, Jonathan R.; Rijmen, Frank – ETS Research Report Series, 2008

The standard errors of the 2 most widely used population-invariance measures of equating functions, root mean square difference (RMSD) and root expected mean square difference (REMSD), are not derived for common equating methods such as linear equating. Consequently, it is unknown how much noise is contained in these estimates. This paper…

Descriptors: Equated Scores, Error of Measurement, Statistical Analysis, Sampling

Effect of Repeaters on Score Equating in a Large-Scale Licensure Test. Research Report. ETS RR-09-27

Peer reviewed
PDF on ERIC

Download full text

Kim, Sooyeon; Walker, Michael E. – ETS Research Report Series, 2009

This study investigated the subgroup invariance of equating functions for a licensure test in the context of a nonequivalent groups with anchor test (NEAT) design. Examinees who had taken a new, to-be-equated form of the test were divided into three subgroups according to their previous testing experience: (a) repeaters who previously took the…

Descriptors: Equated Scores, Licensing Examinations (Professions), Test Construction, Repetition

Investigating the Effectiveness of a Synthetic Linking Function on Small Sample Equating. Research Report. ETS RR-07-37

Peer reviewed
PDF on ERIC

Download full text

Kim, Sooyeon; von Davier, Alina A.; Haberman, Shelby – ETS Research Report Series, 2007

The synthetic function, which is a weighted average of the identity (the trivial linking function for forms that are known to be completely parallel) and a traditional equating method, has been proposed as an alternative for performing linking with very small samples (Kim, von Davier, & Haberman, 2006). The purpose of the present study was to…

Descriptors: Equated Scores, Sample Size, Statistical Analysis, Licensing Examinations (Professions)

Investigating the Effectiveness of Collateral Information on Small-Sample Equating. Research Report. ETS RR-08-52

Peer reviewed
PDF on ERIC

Download full text

Kim, Sooyeon; Linvingston, Samuel A.; Lewis, Charles – ETS Research Report Series, 2008

This paper describes an empirical evaluation of a Bayesian procedure for equating scores on test forms taken by small numbers of examinees, using collateral information from the equating of other test forms. In this procedure, a separate Bayesian estimate is derived for the equated score at each raw-score level, making it unnecessary to specify a…

Descriptors: Equated Scores, Statistical Analysis, Sample Size, Bayesian Statistics

The Impact of Anchor Test Length on Equating Results in a Nonequivalent Groups Design. Research Report. ETS RR-07-44

Peer reviewed
PDF on ERIC

Download full text

Ricker, Kathryn L.; von Davier, Alina A. – ETS Research Report Series, 2007

This study explored the effects of external anchor test length on final equating results of several equating methods, including equipercentile (frequency estimation), chained equipercentile, kernel equating (KE) poststratification PSE with optimal bandwidths, and KE PSE linear (large bandwidths) when using the nonequivalent groups anchor test…

Descriptors: Equated Scores, Test Items, Statistical Analysis, Test Length

Small-Sample DIF Estimation Using Log-Linear Smoothing: A SIBTEST Application. Research Report. ETS RR-07-10

Peer reviewed
PDF on ERIC

Download full text

Puhan, Gautam; Moses, Tim P.; Yu, Lei; Dorans, Neil J. – ETS Research Report Series, 2007

The purpose of the current study was to examine whether log-linear smoothing of observed score distributions in small samples results in more accurate differential item functioning (DIF) estimates under the simultaneous item bias test (SIBTEST) framework. Data from a teacher certification test were analyzed using White candidates in the reference…

Descriptors: Test Bias, Computation, Sample Size, Accuracy

An Elementary Test of the Normal 2PL Model against the Normal 3PL Alternative. Research Report. ETS RR-06-14

Peer reviewed
PDF on ERIC

Download full text

Haberman, Shelby J. – ETS Research Report Series, 2006

A simple score test of the normal two-parameter logistic (2PL) model is presented that examines the potential attraction of the normal three-parameter logistic (3PL) model for use with a particular item. Application is made to data from a test from the Praxis™ series. Results from this example raise the question whether the normal 3PL model should…

Descriptors: Statistical Analysis, Models, Licensing Examinations (Professions), Teacher Certification

Previous Page | Next Page »

Pages: 1 | 2

von Davier, Alina A.	6
Kim, Sooyeon	4
Dorans, Neil J.	3
Haberman, Shelby J.	3
Haberman, Shelby	2
Puhan, Gautam	2
Sinharay, Sandip	2
Tannenbaum, Richard J.	2
Blew, Edwin O.	1
Buzick, Heather	1
Cao, Yi	1
Castellano, Karen	1
Chris Kelbaugh	1
Deng, Weiling	1
Gitomer, Drew	1
Grant, Mary C.	1
Han, Ning	1
Hebert, Delano	1
Holland, Paul W.	1
Jing Miao	1
Kane, Michael T.	1
Kannan, Priya	1
Knorr, Colleen M.	1
Larkin, Kevin C.	1
Leusner, Dawn	1
More ▼