Publication Date
| In 2026 | 0 |
| Since 2025 | 2 |
| Since 2022 (last 5 years) | 33 |
| Since 2017 (last 10 years) | 901 |
| Since 2007 (last 20 years) | 2732 |
Descriptor
| Statistical Analysis | 3988 |
| Hypothesis Testing | 2382 |
| Foreign Countries | 1378 |
| Correlation | 766 |
| Questionnaires | 756 |
| Comparative Analysis | 730 |
| Scores | 548 |
| Testing | 514 |
| College Students | 447 |
| Computer Assisted Testing | 439 |
| Student Attitudes | 425 |
| More ▼ | |
Source
Author
| Tindal, Gerald | 12 |
| Alonzo, Julie | 10 |
| Lord, Frederic M. | 10 |
| Sinharay, Sandip | 10 |
| Lai, Cheng-Fei | 9 |
| Teo, Timothy | 8 |
| Wilcox, Rand R. | 8 |
| Algina, James | 7 |
| Games, Paul A. | 7 |
| Kim, Sooyeon | 7 |
| Marascuilo, Leonard A. | 7 |
| More ▼ | |
Publication Type
Education Level
Audience
| Researchers | 65 |
| Practitioners | 21 |
| Teachers | 20 |
| Students | 6 |
| Administrators | 5 |
| Policymakers | 4 |
| Counselors | 1 |
| Media Staff | 1 |
| Parents | 1 |
Location
| Nigeria | 160 |
| Germany | 80 |
| Australia | 65 |
| Turkey | 64 |
| India | 62 |
| Canada | 59 |
| Iran | 51 |
| Netherlands | 51 |
| China | 47 |
| Taiwan | 47 |
| Texas | 45 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 1 |
Peer reviewedVasilius, Janet M.; DeStephen, Dan – Journal of the American Forensic Association, 1979
Tests two hypotheses concerning success in debate: (1) that success would be enhanced by a fast speaking rate, large amounts of evidence, and use of jargon; and (2) that a high correlation exists among these same variables. Neither hypothesis is supported and several explanations are offered. (JMF)
Descriptors: Competition, Debate, Evaluation, Hypothesis Testing
Peer reviewedBintig, Arnfried – Educational and Psychological Measurement, 1980
Twelve variance-analytical and nonparametrical coefficients of reliability for rating scales designed for rating persons were compared to each other theoretically and empirically. Preference for two coefficients was established. The intraclass correlation coefficient appeared to be useful for the estimation of reliability as well. (Author/RL)
Descriptors: Analysis of Variance, Comparative Analysis, Hypothesis Testing, Mathematical Models
Peer reviewedKlinger, Don A.; Rogers, W. Todd – Alberta Journal of Educational Research, 2003
The estimation accuracy of procedures based on classical test score theory and item response theory (generalized partial credit model) were compared for examinations consisting of multiple-choice and extended-response items. Analysis of British Columbia Scholarship Examination results found an error rate of about 10 percent for both methods, with…
Descriptors: Academic Achievement, Educational Testing, Foreign Countries, High Stakes Tests
Peer reviewedDometrius, Nelson C.; Sigelman, Lee – Economics of Education Review, 1988
A critique of a procedure developed by Becker and Williams (1986) for gauging the extend to which an organization's composition suggests discrimination. Their model and its associated statistical test suffer from two problems: (1) it is difficult to dismiss the null hypothesis of no discrimination; and (2) the model makes an overly simple…
Descriptors: Hypothesis Testing, Models, Organizations (Groups), Research Design
Peer reviewedBlake, Joanna; And Others – Journal of Child Language, 1993
The validity of mean length of utterance (MLU) and a measure of syntactic complexity were tested against the language assessment, remediation, and screening procedure on spontaneous speech samples from 87 children, concluding that MLU is a valid measure of clausal complexity up to 4:5 and that the measure of syntactic complexity is more valid at…
Descriptors: Child Language, Grammar, Measures (Individuals), Oral Language
On the Modeling of Scaled Measurement Sequences: Implications for Analyses of Cognitive Development.
Peer reviewedLittle, Todd D.; Widaman, Keith F. – Intelligence, 1990
The analysis or modeling of Piagetian and psychometric measures of mental ability is discussed. An application of structural modeling procedures allowing the testing of hypotheses in this domain is presented. Such a model is illustrated through a study of numerical functional relations tasks with 77 elementary school students. (SLD)
Descriptors: Cognitive Development, Elementary Education, Elementary School Students, Hypothesis Testing
Peer reviewedLunz, Mary E.; Stahl, John A. – Teaching and Learning in Medicine, 1993
A discussion of multifacet Rasch model analysis describes the Rasch model and its assumptions, then presents an extension of the model to include a facet for the influence of examiner severity. The model is illustrated with an application to an oral examination administered by a medical specialty board. (Author/MSE)
Descriptors: Higher Education, Licensing Examinations (Professions), Medical Education, Models
Peer reviewedHu, Xiangen; Batchelder, William H. – Psychometrika, 1994
The statistical analysis of processing tree models is advanced by showing how the parameters of estimation and hypothesis testing, based on the likelihood functions, can be accomplished by adapting the expectation-maximization (EM) algorithm. The adaptation makes it easy to program a personal computer to accomplish the stages of statistical…
Descriptors: Computer Software, Equations (Mathematics), Estimation (Mathematics), Hypothesis Testing
Wollack, James A. – Applied Measurement in Education, 2006
Many of the currently available statistical indexes to detect answer copying lack sufficient power at small [alpha] levels or when the amount of copying is relatively small. Furthermore, there is no one index that is uniformly best. Depending on the type or amount of copying, certain indexes are better than others. The purpose of this article was…
Descriptors: Statistical Analysis, Item Analysis, Test Length, Sample Size
Bilker, Warren B.; Brensinger, Colleen; Gur, Ruben C. – Multivariate Behavioral Research, 2004
Testing homogeneity of correlations with Fisher's Z is inappropriate when correlations are themselves correlated. Suppose measurements of brain activation and performance are taken before and during a verbal memory task. Of interest are changes in activity gradients in specific regions, R1, R2, R3, and performance, V. The "correlated correlations"…
Descriptors: Statistical Analysis, Interaction, Testing, Factor Analysis
Zimmerman, Donald W.; Zumbo, Bruno D. – Educational and Psychological Measurement, 2005
Educational and psychological testing textbooks typically warn of the inappropriateness of performing arithmetic operations and statistical analysis on percentiles instead of raw scores. This seems inconsistent with the well-established finding that transforming scores to ranks and using nonparametric methods often improves the validity and power…
Descriptors: Statistical Analysis, Psychological Testing, Raw Scores, Evaluation Methods
Richardson, Mary; Rogness, Neal; Gajewski, Byron – Journal of Statistics Education, 2005
This paper describes an interactive activity developed for illustrating hypothesis tests on the mean for paired or matched samples. The activity is extended to illustrate assessing normality, the Wilcoxon signed rank test, Kaplan-Meier survival functions, two-way analysis of variance, and the randomized block design. (Contains 6 tables and 13…
Descriptors: Introductory Courses, Statistics, Hypothesis Testing, Surveys
Wiberg, Marie – Journal of Educational and Behavioral Statistics, 2003
A criterion-referenced computerized test is expressed as a statistical hypothesis problem. This admits that it can be studied by using the theory of optimal design. The power function of the statistical test is used as a criterion function when designing the test. A formal proof is provided showing that all items should have the same item…
Descriptors: Test Items, Computer Assisted Testing, Statistics, Validity
Witt, Elizabeth A.; And Others – 1990
Recent trends in achievement test scores among elementary and secondary school students in Iowa are examined. Form 7 of the Iowa Tests of Basic Skills (ITBS) was used to measure the achievement of students in grades 3 and 7; and Form X-7 of the Iowa Tests of Educational Development (ITED) was used for 11th graders. These grade levels were chosen…
Descriptors: Achievement Tests, Comparative Testing, Educational Trends, Elementary School Students
Guernsey, Lisa – Chronicle of Higher Education, 1999
New computer software for physics, mathematics, computer science, and statistics courses at North Carolina State University and in some high schools allows students to solve problems on the computer, recording every answer submitted to provide faculty with a record of student performance, and providing immediate feedback to students. Computerized…
Descriptors: Case Studies, Computer Assisted Instruction, Computer Assisted Testing, Computer Science

Direct link
