r - group by and filter data management using dplyr

Question

posted Oct 24, 2021 in Technique[技术] by 深蓝 (71.8m points)

Take a simple dataset

a <- c(1,2,3,4,5,6,7,8)
b <- c(1,2,2,1,2,2,2,2)
c <- c(1,1,1,2,2,2,3,3)
d <- data.frame(a,b,c)

now I want to filter my data, so that we group_by(c) and then remove all data where no b=1occurs.

Thus the results (e) should look like d but without the two bottom rows

I have tried using

e <- d %>%
  group_by(c) %>%
  filter(n(b)>1)

The output should contain the data in green below and remove the data in red

enter image description here

See Question&Answers more detail:os

与恶龙缠斗过久,自身亦成为恶龙；凝视深渊过久,深渊将回以凝视…

1 Reply

深蓝 · Answer 1 · 2021-10-23T17:57:48+0000

Try

d %>% 
  group_by(c) %>% 
  filter(any(b == 1))

Which gives:

#Source: local data frame [6 x 3]
#Groups: c
#
#  a b c
#1 1 1 1
#2 2 2 1
#3 3 2 1
#4 4 1 2
#5 5 2 2
#6 6 2 2