ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	2
Since 2006 (last 20 years)	7

Descriptor

Statistical Analysis	50
Testing Problems	50
Scores	28
Test Reliability	18
Equated Scores	14
Test Interpretation	12
Test Validity	12
Correlation	9
Test Theory	9
Achievement Tests	8
Mathematical Models	8
Measurement Techniques	8
Test Construction	8
Elementary Secondary Education	7
Test Bias	7
Comparative Analysis	6
Cutting Scores	6
Item Analysis	6
Scoring	6
Test Items	6
Testing	6
Computer Assisted Testing	5
Error of Measurement	5
Latent Trait Theory	5
Standardized Tests	5
More ▼

Source

Journal of Educational…	4
Educational and Psychological…	3
Applied Psychological…	2
ETS Research Report Series	2
Assessment in Education:…	1
English Language Teaching	1
Journal of Economic Education	1
NCME	1
ProQuest LLC	1
Psychometrika	1
Research Matters	1
More ▼

Publication Type

Reports - Research	35
Journal Articles	14
Speeches/Meeting Papers	9
Reports - Evaluative	4
Tests/Questionnaires	3
Collected Works - Serials	1
Dissertations/Theses -…	1
Guides - Classroom - Teacher	1
Reports - Descriptive	1

Education Level

Secondary Education	2
Elementary Education	1
Higher Education	1
Postsecondary Education	1

Audience

Researchers	6
Practitioners	1
Teachers	1

Location

Asia	1
China	1
Netherlands	1
Taiwan	1

Laws, Policies, & Programs

No Child Left Behind Act 2001

Assessments and Surveys

General Aptitude Test Battery	2
Indiana Statewide Testing for…	2
Metropolitan Achievement Tests	2
SAT (College Admission Test)	2
Test of English as a Foreign…	2
Armed Services Vocational…	1
California Achievement Tests	1
Graduate Record Examinations	1
Pennsylvania Educational…	1
Teacher Efficacy Scale	1
Test of English for…	1
More ▼

What Works Clearinghouse Rating

Showing 1 to 15 of 50 results Save | Export

Which Assessment Is Harder? Some Limits of Statistical Linking

Download full text

Benton, Tom; Williamson, Joanna – Research Matters, 2022

Equating methods are designed to adjust between alternate versions of assessments targeting the same content at the same level, with the aim that scores from the different versions can be used interchangeably. The statistical processes used in equating have, however, been extended to statistically "link" assessments that differ, such as…

Descriptors: Statistical Analysis, Equated Scores, Definitions, Alternative Assessment

Assessing Individual-Level Impact of Interruptions during Online Testing

Peer reviewed

Direct link

Sinharay, Sandip; Wan, Ping; Choi, Seung W.; Kim, Dong-In – Journal of Educational Measurement, 2015

With an increase in the number of online tests, the number of interruptions during testing due to unexpected technical issues seems to be on the rise. For example, interruptions occurred during several recent state tests. When interruptions occur, it is important to determine the extent of their impact on the examinees' scores. Researchers such as…

Descriptors: Computer Assisted Testing, Testing Problems, Scores, Statistical Analysis

Determining the Overall Impact of Interruptions during Online Testing

Peer reviewed

Direct link

Sinharay, Sandip; Wan, Ping; Whitaker, Mike; Kim, Dong-In; Zhang, Litong; Choi, Seung W. – Journal of Educational Measurement, 2014

With an increase in the number of online tests, interruptions during testing due to unexpected technical issues seem unavoidable. For example, interruptions occurred during several recent state tests. When interruptions occur, it is important to determine the extent of their impact on the examinees' scores. There is a lack of research on this…

Descriptors: Computer Assisted Testing, Testing Problems, Scores, Regression (Statistics)

Preparing for the Speaking Tasks of the "TOEFL iBT"® Test: An Investigation of the Journeys of Chinese Test Takers. "TOEFL iBT"® Research Report. TOEFL iBT-28. ETS Research Report. RR-17-19

Peer reviewed
PDF on ERIC

Download full text

Yu, Guoxing; He, Lianzhen; Rea-Dickins, Pauline; Kiely, Richard; Lu, Yanbin; Zhang, Jing; Zhang, Yan; Xu, Shasha; Fang, Lin – ETS Research Report Series, 2017

Language test preparation has often been studied within the consequential validity framework in relation to ethics, equity, fairness, and washback of assessment. The use of independent and integrated speaking tasks in the "TOEFL iBT"® test represents a significant development and innovation in assessing speaking ability in academic…

Descriptors: English (Second Language), Language Tests, Second Language Learning, Oral Language

Educational Measurement Issues and Implications of High Stakes Decision Making in Final Examinations in Secondary Education in the Netherlands

Peer reviewed

Direct link

van Rijn, P. W.; Beguin, A. A.; Verstralen, H. H. F. M. – Assessment in Education: Principles, Policy & Practice, 2012

While measurement precision is relatively easy to establish for single tests and assessments, it is much more difficult to determine for decision making with multiple tests on different subjects. This latter is the situation in the system of final examinations for secondary education in the Netherlands and is used as an example in this paper. This…

Descriptors: Secondary Education, Tests, Foreign Countries, Decision Making

Pedagogical Implications of Score Distribution Pattern and Learner Satisfaction in an Intensive TOEIC Course

Peer reviewed
PDF on ERIC

Download full text

Kang, Che Chang – English Language Teaching, 2014

The study aimed at investigating TOEIC score distribution patterns and learner satisfaction in an intensive TOEIC course and drew implications for pedagogical practice. A one-group pre-test post-test experiment and a survey on learner satisfaction were conducted on Taiwanese college EFL students (n = 50) in a case study. Results showed that the…

Descriptors: Teaching Methods, Second Language Learning, Second Language Instruction, English (Second Language)

Teachers' Motivation and Beliefs in a High-Stakes Testing Context

Direct link

Dawson, Heather S. – ProQuest LLC, 2012

High-stakes testing has created challenges for teachers, administrators, parents, students, and other related education stakeholders in recent decades (Nichols & Berliner, 2007). While high-stakes tests have a long history (Ravitch, 2009) it was not until No Child Left Behind was signed into law in 2002 that the tests became law for most…

Descriptors: Beliefs, High Stakes Tests, Teacher Motivation, Teacher Attitudes

The Stability Coefficient

Peer reviewed

Cureton, Edward E. – Educational and Psychological Measurement, 1971

A derivation of a formula for the stability coefficient is presented and discussed in terms of test reliability over time. (PR)

Descriptors: Error of Measurement, Raw Scores, Statistical Analysis, Test Reliability

Comparability of Scores from Different Tests though on the Same Scale

Peer reviewed

Boldt, R. F. – Educational and Psychological Measurement, 1974

Descriptors: Comparative Testing, Equated Scores, National Norms, Raw Scores

The Effects of Repeaters on Test Equating.

Download full text

Andrulis, Richard S.; And Others – 1974

The purpose of this investigation was to establish the effects of repeaters on test equating. Since consideration was not given to repeaters in test equating, such as in the derivation of equations by Angoff (1971), the hypothetical effect needed to be established. A case study was examined which showed results on a test as expected; overall mean…

Descriptors: Cutting Scores, Equated Scores, Recall (Psychology), Retention (Psychology)

Issues of Reliability and Directional Bias in Standardized Achievement Tests: The Case of Mat70. P-5689.

Download full text

Barker, Pierce; Pelavin, Sol H. – 1976

This study was mounted to assess the validity of standard score transformations of raw test scores and test bias on the 1970 edition of the Metropolitan Achievement Test Battery, in the context of a controversial federally funded compensatory education program, the Educational Voucher Demonstration (EVD). On an individual level the validity of the…

Descriptors: Achievement Gains, Achievement Tests, Educationally Disadvantaged, Elementary Education

Grade Equivalent Scores. If Not Grade Equivalent Scores--Then What? NCME Measurement in Education. A Series of Special Reports of the National Council on Measurement in Education.

Echternacht, Gary; Plas, Jeanne M. – NCME, 1977

While most school districts believe they understand grade equivalent scores, teachers, parents, and measurement specialists frequently misinterpret this apparently simple statistical expression. Echternacht's article describes the construction, application, and interpretation of grade equivalent scores from the test publisher's perspective.…

Descriptors: Achievement Rating, Achievement Tests, Elementary Education, Grade Equivalent Scores

A Comparison of the Nedelsky and Angoff Cutting Score Procedures Using Generalizability Theory.

Peer reviewed

Brennan, Robert L.; Lockwood, Robert E. – Applied Psychological Measurement, 1980

Generalizability theory is used to characterize and quantify expected variance in cutting scores and to compare the Nedelsky and Angoff procedures for establishing a cutting score. Results suggest that the restricted nature of the Nedelsky (inferred) probability scale may limit its applicability in certain contexts. (Author/BW)

Descriptors: Cutting Scores, Generalization, Statistical Analysis, Test Reliability

Discrimination Indices Commonly Used in Military Training Environments: Effects of Departures from Normal Distributions.

Download full text

Sarvela, Paul D. – 1986

Four discrimination indices were compared, using score distributions which were normal, bimodal, and negatively skewed. The score distributions were systematically varied to represent the common circumstances of a military training situation using criterion-referenced mastery tests. Three 20-item tests were administered to 110 simulated subjects.…

Descriptors: Comparative Analysis, Criterion Referenced Tests, Item Analysis, Mastery Tests

The Stability of the SAT-Verbal Score Scale.

Modu, Christopher C.; Stern, June – 1977

To assess the stability of the Scholastic Aptitude Test verbal score scale SAT--V, 1963 and 1973 forms of the SAT--V were administered in counterbalanced order to spaced samples of the same group. The 1973 scores were placed on the reporting scale used for the 1963 form. The experimentally derived scores on the 1963 scale were compared with their…

Descriptors: College Bound Students, College Entrance Examinations, Educational Problems, Educational Trends

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4

Barker, Pierce	2
Bormuth, John R.	2
Choi, Seung W.	2
Kim, Dong-In	2
Legg, Sue M.	2
Pelavin, Sol H.	2
Sinharay, Sandip	2
Wan, Ping	2
Alderman, Donald L.	1
Algina, James	1
Andrulis, Richard S.	1
Beguin, A. A.	1
Benton, Tom	1
Boldt, R. F.	1
Brennan, Robert L.	1
Budescu, David	1
Cope, Ronald T.	1
Cramer, Stephen E.	1
Cureton, Edward E.	1
Dawson, Heather S.	1
Echternacht, Gary	1
Fang, Lin	1
Gohmann, Stephan F.	1
Gordon, Howard R. D.	1
More ▼