Lets say this is my data-frame
df = pd.DataFrame({ 'bio' : ['1', '1', '1', '4'],
'center' : ['one', 'one', 'two', 'three'],
'outcome' : ['f','t','f','f'] })
It looks like this ...
bio center outcome
0 1 one f
1 1 one t
2 1 two f
3 4 three f
I want to drop row 1 because it has the same bio & center as row 0.
I want to keep row 2 because it has the same bio but different center then row 0.
Something like this won't work based on drop_duplicates input structure but it's what I am trying to do
df.drop_duplicates(subset = 'bio' & subset = 'center' )
Any suggestions ?
edit : changed df a bit to fit example by correct answer
See Question&Answers more detail:
os 与恶龙缠斗过久,自身亦成为恶龙;凝视深渊过久,深渊将回以凝视…