Welcome To Ask or Share your Answers For Others

dataframe - Sort column names in specific order

Welcome To Ask or Share your Answers For Others

1 Reply

replyed Jan 27, 2021 by 深蓝 (71.8m points)

You can sort the column names according to the number before and after the underscore:

df2 = df.select(
    'id',
    *sorted(
        df.columns[1:], key=lambda c: (int(c.split('_')[0]), int(c.split('_')[1]))
    )
)

To get the other desired output, just swap 0 with 1 in the code above.

与恶龙缠斗过久,自身亦成为恶龙；凝视深渊过久,深渊将回以凝视…

...