I am tryin to format bibliography in a dataframe. Basically, for a column named "Bibliography", I want to extract the titles, sometimes delimited by and sometimes delimited by "
Now when I use
df.loc[df['Bibliography'].str.extract(?<=")(.*?)(?=,")
It correctly extracts the titles delimited by " (but will produce NaN for titles delimited by )
So I tried applying str.extract over a slice of the data frame using .loc
df.loc[df['Bibliography'].str.contains('"'),'Bibliography']=df.loc[df['Bibliography'].str.contains('"'),'Bibliography'].str.extract(r'(?<=")(.*?)(?=,")')
But this results in NaN. I can't figure out why I can't use extract over a slice of the data frame.
与恶龙缠斗过久,自身亦成为恶龙;凝视深渊过久,深渊将回以凝视…