ERIC - Search Results

Publication Date

In 2026	0
Since 2025	8
Since 2022 (last 5 years)	36
Since 2017 (last 10 years)	115
Since 2007 (last 20 years)	378

Descriptor

Test Theory	1166
Test Items	262
Test Reliability	252
Test Construction	246
Test Validity	245
Psychometrics	183
Scores	176
Item Response Theory	168
Foreign Countries	160
Item Analysis	141
Statistical Analysis	134
Higher Education	132
Mathematical Models	132
Measurement Techniques	123
Comparative Analysis	121
Correlation	114
Error of Measurement	114
Latent Trait Theory	112
Test Interpretation	112
Testing	111
Evaluation Methods	106
Models	98
Testing Problems	93
Elementary Secondary Education	90
Difficulty Level	85
More ▼

Education Level

Higher Education	96
Postsecondary Education	66
Secondary Education	50
Elementary Education	40
Elementary Secondary Education	29
Middle Schools	27
High Schools	24
Junior High Schools	22
Grade 8	18
Grade 7	14
Grade 4	13
Grade 6	11
Adult Education	10
Early Childhood Education	10
Grade 5	10
Intermediate Grades	10
Grade 3	9
Primary Education	6
Grade 2	4
Preschool Education	4
Grade 10	3
Grade 9	3
Kindergarten	3
Grade 1	2
Grade 12	2
More ▼

Audience

Researchers	81
Practitioners	42
Teachers	22
Students	6
Administrators	5
Policymakers	4
Counselors	2

Location

United States	17
United Kingdom (England)	15
Canada	14
Australia	13
Turkey	12
Sweden	8
United Kingdom	8
Netherlands	7
Texas	7
New York	6
Taiwan	6
United Kingdom (Great Britain)	6
Florida	5
Japan	5
Spain	5
Tennessee	5
United Kingdom (Wales)	5
California	4
Colorado	4
Israel	4
Chile	3
China	3
Germany	3
Illinois	3
Indonesia	3
More ▼

Laws, Policies, & Programs

No Child Left Behind Act 2001	4
Elementary and Secondary…	3
Individuals with Disabilities…	3

What Works Clearinghouse Rating

Showing 751 to 765 of 1,166 results Save | Export

Computerized Adaptive Testing for Reading Placement and Diagnostic Assessment.

Peer reviewed

Shermis, Mark D.; And Others – Journal of Developmental Education, 1996

Describes a study to pilot-test a new reading assessment instrument designed to function in a computerized adaptive testing (CAT) environment. Indicates that the measure showed fair internal consistency and correlated well with other tests. Discusses advantages and disadvantages of CAT systems and describes the HyperCAT testing program. (23…

Descriptors: Computer Assisted Testing, Diagnostic Tests, Higher Education, Pilot Projects

A Comparison of Three Simple Test Theory Models.

Peer reviewed

Ramsay, James O. – Psychometrika, 1989

An alternative to the Rasch model is introduced. It characterizes strength of response according to the ratio of ability and difficulty parameters rather than their difference. Joint estimation and marginal estimation models are applied to two test data sets. (SLD)

Descriptors: Ability, Bayesian Statistics, College Entrance Examinations, Comparative Analysis

A Critical Examination of the "Reliability" and "Abnormality" Approaches to the Evaluation of Subtest Score Differences.

Peer reviewed

Cahan, Sorel – Educational and Psychological Measurement, 1989

Statistical significance and "abnormality" have been used as criteria for the evaluation of intra-individual subtest score differences. Shortcomings of these criteria are identified, and improved estimates of the true score differences are suggested. The applicability of the abnormality criterion to these improved estimates is reviewed.…

Descriptors: Estimation (Mathematics), Evaluation Methods, Individual Differences, Mathematical Models

Argumentativeness and Verbal Aggressiveness: Testing for Conceptual and Measurement Equivalence across Cultures.

Peer reviewed

Suzuki, Shinobu; Rancer, Andrew S. – Communication Monographs, 1994

Finds that the two-factor solution of the Argumentativeness Scale and the Verbal Aggressiveness Scale was a reasonable overall fit to samples of both U.S. and Japanese college students; orthogonality of the two constructs (argumentativeness and verbal aggressiveness) held for both samples; and the two scales had satisfactory construct validity for…

Descriptors: Communication Research, Construct Validity, Cross Cultural Studies, Evaluation Methods

Rater Reliability: A Maximum Likelihood Confirmatory Factor-Analytic Approach.

Peer reviewed

O'Grady, Kevin E.; Medoff, Deborah R. – Multivariate Behavioral Research, 1991

A procedure for evaluating a variety of rater reliability models is presented. A multivariate linear model is used to describe and assess a set of ratings. Parameters are represented in terms of a factor analytic model, and maximum likelihood methods test the model parameters. Illustrative examples are presented. (SLD)

Descriptors: Comparative Analysis, Correlation, Equations (Mathematics), Estimation (Mathematics)

Reflections on Stephen Jay Gould's "The Mismeasure of Man" (1981): A Retrospective Review. Book Review.

Peer reviewed

Carroll, John B. – Intelligence, 1995

It is argued that the statements and accusations made by Stephen Jay Gould about the use of factor analysis are incorrect and unjustified and that tests properly designed for the purpose can adequately measure a "general" or "g" factor of intelligence, particularly in view of the developments in testing since "The…

Descriptors: Factor Analysis, Intelligence Tests, Measurement Techniques, Nature Nurture Controversy

Modern Language Testing at the Turn of the Century: Assuring That What We Count Counts.

Peer reviewed

Bachman, Lyle F. – Language Testing, 2000

Reviews developments in language testing research and practice over the last 20 years, and suggests future directions in the areas of professionalizing the field and validation research. Argues that concerns for ethical conduct must be grounded in valid test use, so that professionalization and validation research are inseparable. (Author/VWL)

Descriptors: Ethics, Language Research, Language Tests, Second Language Instruction

Relation between Science Teachers' Assessment Tools and Students Cognitive Development

Download full text

Ozsevgec, Tuncay; Cepni, Salih – Online Submission, 2006

In order to determine students' achievement, science teachers have to develop their own assessment tools. This study attempts to find out the relationship between the teachers' assessment tools and students' cognitive development according to the teachers' teaching experiences. Six open-ended survey questions were developed and delivered to 59…

Descriptors: Foreign Countries, Correlation, Science Teachers, Evaluation Methods

Congeneric and (Essentially) Tau-Equivalent Estimates of Score Reliability: What They Are and How to Use Them

Peer reviewed

Direct link

Graham, James M. – Educational and Psychological Measurement, 2006

Coefficient alpha, the most commonly used estimate of internal consistency, is often considered a lower bound estimate of reliability, though the extent of its underestimation is not typically known. Many researchers are unaware that coefficient alpha is based on the essentially tau-equivalent measurement model. It is the violation of the…

Descriptors: Models, Test Theory, Reliability, Structural Equation Models

Improving Measurement in Health Education and Health Behavior Research Using Item Response Modeling: Comparison with the Classical Test Theory Approach

Peer reviewed

Direct link

Wilson, Mark; Allen, Diane D.; Li, Jun Corser – Health Education Research, 2006

This paper compares the approach and resultant outcomes of item response models (IRMs) and classical test theory (CTT). First, it reviews basic ideas of CTT, and compares them to the ideas about using IRMs introduced in an earlier paper. It then applies a comparison scheme based on the AERA/APA/NCME "Standards for Educational and…

Descriptors: Health Education, Self Efficacy, Health Behavior, Measures (Individuals)

Cross-Validation and Rasch Analyses of the Australian Version of the Multidimensional School Anger Inventory--Revised

Peer reviewed

Direct link

Boman, Peter; Curtis, David; Furlong, Michael J.; Smith, Douglas C. – Journal of Psychoeducational Assessment, 2006

The construct validity of the Australian version of the Multidimensional School Anger Inventory-Revised (MSAI-R) was examined using exploratory factor analysis (EFA), Rasch analysis, and confirmatory factor analysis (CFA) on a sample of 1,400 Australian students enrolled in Years 8 through 12. The EFA revealed a strong replication of the MSAI-R's…

Descriptors: Affective Measures, Psychological Patterns, Construct Validity, Reliability

The Use of Vygotsky's Theory of the Zone of Proximal Development in Quantitative Research: A Critical Review.

Download full text

Hayward, Pamela A. – 1995

This review critiques the use of Lev Vygotsky's concept of the zone of proximal development (ZPD) in quantitative research that focuses on the role communication plays in learning. A study that makes claims in terms of the ZPD should include a pretest, a problem-solving activity, and a posttest. Without these minimal elements, researchers are not…

Descriptors: Communication Research, Communication (Thought Transfer), Learning Processes, Pretests Posttests

Using Differential Item Functioning Procedures To Improve Interpretation of and Performance on the Verbal Subtest of the SAT.

Download full text

Lai, Morris K.; Saka, Thomas – 1993

Two studies investigated factors affecting the scores of Hawaii students taking the verbal subtest of the Scholastic Aptitude Test (SAT). For the past several years, the mean verbal scores of Hawaii students have consistently been among the lowest 10% of all states. The first study addressed the identification of items and types of items that have…

Descriptors: Comparative Analysis, High School Seniors, High Schools, Instructional Effectiveness

The Robustness of LOGIST and BILOG IRT Estimation Programs to Violations of Local Independence.

Download full text

Ackerman, Terry A. – 1987

One of the important underlying assumptions of all item response theory (IRT) models is that of local independence. This assumption requires that the response to an item on a test not be influenced by the response to any other items. This assumption is often taken for granted, with little or no scrutiny of the response process required to answer…

Descriptors: Computer Software, Correlation, Estimation (Mathematics), Latent Trait Theory

Theoretical and Empirical Comparisons of Holistic and Analytic Scoring of Written and Spoken Discourse.

Download full text

Goulden, Nancy Rost – 1989

Since speech communication evaluators are beginning to adapt the analytic and holistic instruments and methods used for rating written products to oral products and performance, this research review investigated: (1) what the labels "analytic" and "holistic" mean; (2) the theoretical bases of the two scoring approaches; and (3)…

Descriptors: Comparative Analysis, Higher Education, Holistic Evaluation, Rating Scales

« Previous Page | Next Page »

Pages: 1 | ... | 47 | 48 | 49 | 50 | 51 | 52 | 53 | 54 | 55 | ... | 78

Educational and Psychological…	63
Psychometrika	48
Journal of Educational…	35
Applied Psychological…	34
ProQuest LLC	26
Educational Measurement:…	23
Language Testing	15
Measurement:…	15
Journal of Educational…	13
Online Submission	13
Assessment in Education:…	12
International Journal of…	12
International Journal of…	11
Applied Measurement in…	10
Journal of Educational and…	10
Journal of Experimental…	8
Alberta Journal of…	7
ETS Research Report Series	7
Journal of School Psychology	7
Annual Review of Applied…	6
Educational Research and…	6
Intelligence	6
Physical Review Physics…	6
Practical Assessment,…	6
School Psychology Review	6
More ▼

Mislevy, Robert J.	20
Zimmerman, Donald W.	15
van der Linden, Wim J.	15
Sinharay, Sandip	9
Andrich, David	8
Haladyna, Tom	7
Wilcox, Rand R.	7
Williams, Richard H.	7
Yen, Wendy M.	7
Brennan, Robert L.	6
Dorans, Neil J.	6
Haberman, Shelby J.	6
Holland, Paul W.	6
Huynh, Huynh	6
Prather, Edward E.	6
Wainer, Howard	6
Baird, Jo-Anne	5
Cliff, Norman	5
Petscher, Yaacov	5
Roid, Gale	5
Thompson, Bruce	5
Tindal, Gerald	5
Zumbo, Bruno D.	5
Engelhard, George, Jr.	4
More ▼

Journal Articles	733
Reports - Research	619
Reports - Evaluative	215
Speeches/Meeting Papers	187
Reports - Descriptive	120
Opinion Papers	113
Information Analyses	67
Dissertations/Theses -…	26
Guides - Non-Classroom	26
Tests/Questionnaires	26
Numerical/Quantitative Data	22
Books	13
Book/Product Reviews	11
Reference Materials -…	8
Collected Works - General	7
Guides - Classroom - Teacher	7
Collected Works - Proceedings	6
ERIC Publications	6
Guides - Classroom - Learner	6
Reports - General	5
Collected Works - Serials	4
Historical Materials	4
Dissertations/Theses -…	2
ERIC Digests in Full Text	2
Guides - General	2
More ▼

SAT (College Admission Test)	23
National Assessment of…	11
Wechsler Intelligence Scale…	11
Armed Services Vocational…	10
ACT Assessment	9
Graduate Record Examinations	7
Comprehensive Tests of Basic…	6
Program for International…	6
Test of English as a Foreign…	6
Trends in International…	5
California Achievement Tests	4
Kaufman Assessment Battery…	4
Stanford Binet Intelligence…	4
Bayley Scales of Infant…	3
Law School Admission Test	3
Stanford Achievement Tests	3
Strengths and Difficulties…	3
ACTFL Oral Proficiency…	2
Advanced Placement…	2
Alabama High School…	2
Childrens Depression Inventory	2
Eysenck Personality Inventory	2
General Aptitude Test Battery	2
Graduate Management Admission…	2
Learning and Study Strategies…	2
More ▼