'R Commander' 태그의 글 목록 (8 Page)

R Commander

1.2. Normal probabilities... 2022.03.12
1.1. Normal quantiles... 2022.03.12
1. Single-sample proportion test... 2022.03.12
7. Effect plots... 2022.03.09
Cowles 데이터셋 2022.03.09
Friendly 데이터셋 2022.03.09
4. Kruskal-Wallis test... 2022.03.09
4. Multinomial logit model... 2022.03.09
3. Generalized linear model... 2022.03.09
birthwt 데이터셋 2022.03.09

1.2. Normal probabilities...

modernity4Rcmdr 2022. 3. 12. 10:14

2022. 3. 12. 10:14

분포도 > 연속 분포 > 정규 분포 > 정규 확률...
Distributions > Continuous distributions > Normal distribution > Normal probabilities...

사례 값을 넣고, 분포도의 (꼬리) 방향을 정해주면 확률이 계산된다.

pnorm(c(1.644854), mean=0, sd=1, lower.tail=TRUE)
pnorm(c(1.644854), mean=0, sd=1, lower.tail=FALSE)
pnorm(c(-1.644854), mean=0, sd=1, lower.tail=TRUE)
pnorm(c(-1.644854), mean=0, sd=1, lower.tail=FALSE)

'Distributions > Continuous distributions' 카테고리의 다른 글

1.4. Sample from normal distribution... (0)	2022.03.12
1.3. Plot normal distribution... (0)	2022.03.12
1.1. Normal quantiles... (0)	2022.03.12

1.1. Normal quantiles...

modernity4Rcmdr 2022. 3. 12. 09:57

2022. 3. 12. 09:57

분포도 > 연속 분포 > 정규 분포 > 정규 분위수...

Distributions > Continuous distributions > Normal distribution > Normal quantiles...

확률을 넣고, 분포도의 (꼬리) 방향을 정해주면, 분위수가 계산된다. <확률>을 95%(.095)로 선택해보자. <낮은쪽 꼬리/높은쪽 꼬리> 선택에 따라 어떻게 값이 변하는지 살펴보자.

qnorm(c(.95), mean=0, sd=1, lower.tail=TRUE)
qnorm(c(.95), mean=0, sd=1, lower.tail=FALSE)

아래 화면에서 95% 확률로 <낮은쪽 꼬리/높은쪽 꼬리> 방향의 값을 확인할 수 있다.

'Distributions > Continuous distributions' 카테고리의 다른 글

1.4. Sample from normal distribution... (0)	2022.03.12
1.3. Plot normal distribution... (0)	2022.03.12
1.2. Normal probabilities... (0)	2022.03.12

1. Single-sample proportion test...

modernity4Rcmdr 2022. 3. 12. 09:33

2022. 3. 12. 09:33

통계 > 비율 > 일-표본 비율 검정...

Statistics > Proportions > Single-sample proportion test...

요인형 변수를 두개 이상 가지고 있는 데이터셋이 활성화되어 있다면, '통계 > 비율 > 이-표본 비율 검정..' 메뉴 기능을 이용할 수 있다. carData 패키지에 있는 Chile 데이터셋을 활용해서 연습해보자. 먼저, '데이터 > 패키지에 있는 데이터 > 첨부된 패키지에서 데이터셋 읽기...' 메뉴 기능을 통하여 Chile 데이터셋을 활성화시키자. R Commander 상단에 'Chile'라는 데이터셋이 활성화되었는지 확인하자.

https://rcmdr.tistory.com/239

Chile 데이터셋

carData::Chile() data(Chile, package="carData") '데이터 > 패키지에 있는 데이터 > 첨부된 패키지에서 데이터셋 읽기...' 메뉴 기능을 선택하면 하위 선택 창으로 이동한다. 아래와 같이 carData 패키지를 선택.

rcmdr.kr

요인형 변수 vote를 변형시켜 vote.f 변수를 새롭게 코딩하고 사용하도록 하자.

data(Chile, package="carData")
Chile <- within(Chile, {
  vote.f <- Recode(vote, '"Y" = "yes"; "N" = "no"; else = NA', as.factor=TRUE)
})

<선택기능> 창에 표시되어 있는 기본 설정을 그대로 사용하자.

local({
  .Table <- xtabs(~ vote.f , data= Chile )
  cat("\nFrequency counts (test is for first level):\n")
  print(.Table)
  prop.test(rbind(.Table), alternative='two.sided', p=.5, conf.level=.95, correct=FALSE)
})

출력창에 나오는 결과는 아래와 같다:

?prop.test  # stats 패키지의 prop.test 도움말 보기

heads <- rbinom(1, size = 100, prob = .5)
prop.test(heads, 100)          # continuity correction TRUE by default
prop.test(heads, 100, correct = FALSE)

## Data from Fleiss (1981), p. 139.
## H0: The null hypothesis is that the four populations from which
##     the patients were drawn have the same true proportion of smokers.
## A:  The alternative is that this proportion is different in at
##     least one of the populations.

smokers  <- c( 83, 90, 129, 70 )
patients <- c( 86, 93, 136, 82 )
prop.test(smokers, patients)

https://rcmdr.tistory.com/53

1. Recode variables...

데이터 > 활성 데이터셋의 변수 관리하기 > 변수를 다시 코딩하기... Data > Manage variables in active data set > Recode variables... 기존 변수를 이용하여 새로운 변수를 만들 수 있다. R Commander에서 이..

rcmdr.kr

'Statistics > Proportions' 카테고리의 다른 글

2. Two-sample proportions test... (0)	2022.06.30

7. Effect plots...

modernity4Rcmdr 2022. 3. 9. 19:03

2022. 3. 9. 19:03

모델 > 그래프 > 효과 그림...

Models > Graphs > Effect plots...

'모델 > 그래프 > 효과 그림...' 기능은 미리 모델이 만들어져야 이용할 수 있다. 만들어진 모델은 아래와 같이 R Commander 상단에서 확인할 수 있다. carData 패키지의 Cowles 데이터셋으로 만든 GLM.1 모델을 활용하는 것이다.

<모델 효과 그림(들)> 창 중간에 있는 <예측변수 (하나 이상 선택)> 기능에서 sex, neuroticism, extraversion 세 변수를 모두 선택해보자.

plot(allEffects(GLM.1))

carData 패키지의 Prestige 데이터셋을 이용하여 연습해보자. 아래와 같이 prestige (직업의 사회적 권위)에 대한 education (교육연수), income (연수입), women (여성 참여율)의 영향력을 type (직업유형)별로 살펴보는 모델을 만들었다고 가정하자.

data(Prestige, package="carData")
LinearModel.1 <- lm(prestige ~ education + income + women + type, data=Prestige)
summary(LinearModel.1)

아래와 같이 LinearModel.1의 요약 정보가 출력될 것이다.

이러한 LinearModel.1의 효과 그림을 시각화 할 수 있다. <모델 효과 그림(들)> 창의 <예측변수(하나 이상 선택)> 기능에서 네개의 변수를 모두 선택해보자. 그리고 예(OK) 버튼을 누른다.

plot(allEffects(LinearModel.1))

아래와 같이 그래픽 장치 창에 선택된 변수 네개의 효과 그림이 등장할 것이다.

한편, <잔차 일부분 그리기> 기능을 선택해보자.

그래픽 장치 창에 잔차들이 플롯으로 표시된다. 표시된 잔차의 분포를 보면서 추가로로 통찰력을 키울 수 있다.

'Models > Graphs' 카테고리의 다른 글

6. Influence index plot... (0)	2022.06.21
2. Residual quantile-comparison plot... (0)	2022.06.21
5. Influence plot... (0)	2022.06.21
4. Added-variable plots... (0)	2022.06.20
3. Component + residual plots... (0)	2022.06.20

Cowles 데이터셋

modernity4Rcmdr 2022. 3. 9. 18:20

2022. 3. 9. 18:20

carData::Cowles

data(Cowles, package="carData")

help("Cowles")

Cowles {carData}

R Documentation

Cowles and Davis's Data on Volunteering

Description

The Cowles data frame has 1421 rows and 4 columns. These data come from a study of the personality determinants of volunteering for psychological research.

Usage

Cowles

Format

This data frame contains the following columns:

neuroticism

scale from Eysenck personality inventory

extraversion

scale from Eysenck personality inventory

sex

a factor with levels: female; male

volunteer

volunteeing, a factor with levels: no; yes

Source

Cowles, M. and C. Davis (1987) The subject matter of psychology: Volunteers. British Journal of Social Psychology 26, 97–102.

[Package carData version 3.0-5 Index]

Friendly 데이터셋

modernity4Rcmdr 2022. 3. 9. 18:08

2022. 3. 9. 18:08

carData::Friendly()

data(Friendly, package="carData")

help("Friendly")

Friendly {carData}

R Documentation

Format Effects on Recall

Description

The Friendly data frame has 30 rows and 2 columns. The data are from an experiment on subjects' ability to remember words based on the presentation format.

Usage

Friendly

Format

This data frame contains the following columns:

condition

A factor with levels: Before, Recalled words presented before others; Meshed, Recalled words meshed with others; SFR, Standard free recall.

correct

Number of words correctly recalled, out of 40 on final trial of the experiment.

Source

Friendly, M. and Franklin, P. (1980) Interactive presentation in multitrial free recall. Memory and Cognition 8 265–270 [Personal communication from M. Friendly].

References

Fox, J. (2016) Applied Regression Analysis and Generalized Linear Models, Third Edition. Sage.

Fox, J. and Weisberg, S. (2019) An R Companion to Applied Regression, Third Edition, Sage.

[Package carData version 3.0-5 Index]

4. Kruskal-Wallis test...

modernity4Rcmdr 2022. 3. 9. 17:49

2022. 3. 9. 17:49

통계 > 비모수 검정 > Kruskal-Wallis 검정...
Statistics > Nonparametric tests > Kruskal-Wallis test...

carData 패키지에 있는 Friendly 데이터셋을 활용한다. https://rcmdr.tistory.com/153

Friendly 데이터셋

carData > Friendly data(Friendly, package="carData") help("Friendly")

rcmdr.kr

data(Friendly, package="carData")
Tapply(correct ~ condition, median, na.action=na.omit, data=Friendly) # medians by group
kruskal.test(correct ~ condition, data=Friendly)

'Statistics > Nonparametric tests' 카테고리의 다른 글

3. Paired-samples Wilcoxon test... (0)	2022.03.21
2. Single-sample Wilcoxon test... (0)	2022.03.21
1. Two-sample Wilcoxon test... (0)	2022.03.21
5. Friedman rank-sum test... (0)	2022.03.20

4. Multinomial logit model...

modernity4Rcmdr 2022. 3. 9. 15:14

2022. 3. 9. 15:14

통계 > 적합성 모델 > 다항 로짓 모델...
Statistics > Fit models > Multinomial logit model...

MASS 패키지에 있는 birthwt 데이터셋을 활용해보자. 먼저 birthwt 데이터셋을 활성화 시킨다. https://rcmdr.kr/149

birthwt 데이터셋

MASS > birthwt data(birthwt, package="MASS") birthwt 데이터셋이 활성화된 후, <데이터셋 보기> 버튼을 누르면 아래와 같이 내부 구성을 볼 수 있다: help("birthwt")

rcmdr.kr

이후 일부 변수의 특징을 변화시켜 bwt라는 새로운 데이터셋을 만들자.아래의 <일반화 선형 모델> 연습 중간에 bwt를 만드는 스크립트가 있다. https://rcmdr.tistory.com/150

3. Generalized linear model...

통계 > 적합성 모델 > 일반화 선형 모델... Statistics > Fit models > Generalized linear model... MASS 패키지에 있는 birthwt 데이터셋을 활용하자. https://rcmdr.tistory.com/149 birthwt data(birthwt, pa..

rcmdr.kr

아래 화면은 bwt 데이터셋을 활용한 사례이다. low 변수를 반응변수로, 다른 나머지 변수를 설명변수군에 모두 포함시키자.

MLM.1 <- multinom(low ~ ., data=bwt, trace=FALSE)
summary(MLM.1, cor=FALSE, Wald=TRUE)

MLM.1 이라는 모형이 만들어졌다면, '모델 > 가설 검정 > 분산분석표...' 메뉴 기능을 이용할 수 있다.

'Statistics > Fit models' 카테고리의 다른 글

5. Ordinal regression model... (0)	2022.06.24
6. Linear mixed model... (0)	2022.06.23
3. Generalized linear model... (0)	2022.03.09
2. Linear model... (0)	2022.03.07
1. Linear regression... (0)	2022.03.07

3. Generalized linear model...

modernity4Rcmdr 2022. 3. 9. 14:42

2022. 3. 9. 14:42

통계 > 적합성 모델 > 일반화 선형 모델...
Statistics > Fit models > Generalized linear model...

MASS 패키지에 있는 birthwt 데이터셋을 활용하자. https://rcmdr.tistory.com/149

birthwt 데이터셋

MASS > birthwt data(birthwt, package="MASS") birthwt 데이터셋이 활성화된 후, <데이터셋 보기> 버튼을 누르면 아래와 같이 내부 구성을 볼 수 있다: help("birthwt")

rcmdr.kr

birthwt 데이터셋을 <일반화 선형 모델> 분석을 위하여 변형시켜서 bwt라는 데이터셋으로 만들자.

bwt <- with(birthwt, {
race <- factor(race, labels = c("white", "black", "other"))
ptd <- factor(ptl > 0)
ftv <- factor(ftv)
levels(ftv)[-(1:2)] <- "2+"
data.frame(low = factor(low), age, lwt, race, smoke = (smoke > 0),
           ptd, ht = (ht > 0), ui = (ui > 0), ftv)
})

options(contrasts = c("contr.treatment", "contr.poly"))

bwt 데이터셋을 활성화시키자.

GLM.3 <- glm(low ~ ., family=binomial(logit), data=bwt)
summary(GLM.3)

exp(coef(GLM.3))  # Exponentiated coefficients ("odds ratios")

## an example with offsets from Venables & Ripley (2002, p.189)
utils::data(anorexia, package = "MASS")

GLM.2 <- glm(Postwt ~ Prewt + Treat + offset(Prewt), 
	family=gaussian(identity), data=anorexia)
summary(GLM.2)

data(Cowles, package="carData")

GLM.1 <- glm(volunteer ~ sex + neuroticism*extraversion, family=binomial(logit), data=Cowles)
summary(GLM.1)
exp(coef(GLM.1))  # Exponentiated coefficients ("odds ratios")

'Statistics > Fit models' 카테고리의 다른 글

5. Ordinal regression model... (0)	2022.06.24
6. Linear mixed model... (0)	2022.06.23
4. Multinomial logit model... (0)	2022.03.09
2. Linear model... (0)	2022.03.07
1. Linear regression... (0)	2022.03.07

birthwt 데이터셋

modernity4Rcmdr 2022. 3. 9. 13:48

2022. 3. 9. 13:48

MASS::birthwt

data(birthwt, package="MASS")

birthwt 데이터셋이 활성화된 후, <데이터셋 보기> 버튼을 누르면 아래와 같이 내부 구성을 볼 수 있다:

help("birthwt")

birthwt {MASS}

R Documentation

Risk Factors Associated with Low Infant Birth Weight

Description

The birthwt data frame has 189 rows and 10 columns. The data were collected at Baystate Medical Center, Springfield, Mass during 1986.

Usage

birthwt

Format

This data frame contains the following columns:

low

indicator of birth weight less than 2.5 kg.

age

mother's age in years.

lwt

mother's weight in pounds at last menstrual period.

race

mother's race (1 = white, 2 = black, 3 = other).

smoke

smoking status during pregnancy.

ptl

number of previous premature labours.

history of hypertension.

presence of uterine irritability.

ftv

number of physician visits during the first trimester.

bwt

birth weight in grams.

Source

Hosmer, D.W. and Lemeshow, S. (1989) Applied Logistic Regression. New York: Wiley

References

Venables, W. N. and Ripley, B. D. (2002) Modern Applied Statistics with S. Fourth edition. Springer.

Examples

bwt <- with(birthwt, {
race <- factor(race, labels = c("white", "black", "other"))
ptd <- factor(ptl > 0)
ftv <- factor(ftv)
levels(ftv)[-(1:2)] <- "2+"
data.frame(low = factor(low), age, lwt, race, smoke = (smoke > 0),
           ptd, ht = (ht > 0), ui = (ui > 0), ftv)
})
options(contrasts = c("contr.treatment", "contr.poly"))
glm(low ~ ., binomial, bwt)

[Package MASS version 7.3-55 Index]

bwt.RData

0.00MB

'Dataset_info > birthwt' 카테고리의 다른 글

bwt 데이터셋 (0)	2022.06.24

PREV 이전 1 ···5 6 7 8 9 10 11 ···21 NEXT 다음

Rcmdr.kr: An R Commander User in Korea

R Commander

1.2. Normal probabilities...

'Distributions > Continuous distributions' 카테고리의 다른 글

1.1. Normal quantiles...

'Distributions > Continuous distributions' 카테고리의 다른 글

1. Single-sample proportion test...

'Statistics > Proportions' 카테고리의 다른 글

7. Effect plots...

'Models > Graphs' 카테고리의 다른 글

Cowles 데이터셋

Cowles and Davis's Data on Volunteering

Description

Usage

Format

Source

Friendly 데이터셋

Format Effects on Recall

Description

Usage

Format

Source

References

4. Kruskal-Wallis test...

'Statistics > Nonparametric tests' 카테고리의 다른 글

4. Multinomial logit model...

'Statistics > Fit models' 카테고리의 다른 글

3. Generalized linear model...

'Statistics > Fit models' 카테고리의 다른 글

birthwt 데이터셋

Risk Factors Associated with Low Infant Birth Weight

Description

Usage

Format

Source

References

Examples

'Dataset_info > birthwt' 카테고리의 다른 글

+ Recent posts

티스토리툴바