So I've got 2 data-frames I'd like to merge together.
I'm merging on 3 columns, 2 is an easy join.
joined_df = pd.merge(df1, df2, how='left', on=['name', 'city'])
I want this to be using a third column, but it's going to be a comparison, something like this:
joined_df = pd.merge(df1, df2, how='left',
on=['name', 'city', 'df1.year' >= 'df2.year_min'])
Not sure what the right syntax is here.
If it was SQL, it would be easy for me.
SELECT * FROM df1
JOIN df2 on (df1.name = df2.name and df1.year = df2.year and df1.year > df2.year_min)
Any assistance?
See Question&Answers more detail:
os 与恶龙缠斗过久,自身亦成为恶龙;凝视深渊过久,深渊将回以凝视…