How can I replace a column with its hash value (like MD5) in awk or sed?
The original file is super huge, so I need this to be really efficient.
I copy pasted larsks's response, but I have added the close line, to avoid the problem indicated in this post: gawk / awk: piping date to getline *sometimes* won't work
awk '{ tmp="echo " $2 " | openssl md5 | cut -f2 -d" "" tmp | getline cksum close(tmp) $2=cksum print }' < sample
1.4m articles
1.4m replys
5 comments
57.0k users