sed - Renumbering duplicate lines with counter awk

Question

Welcome To Ask or Share your Answers For Others

sed - Renumbering duplicate lines with counter awk

posted Oct 7, 2021 in Technique[技术] by 深蓝 (71.8m points)

sed - Renumbering duplicate lines with counter awk

I have duplicate words in csv. And i need to count it in such way:

jsmith
jsmith
kgonzales
shouston
dgenesy
kgonzales
jsmith

to this:

[email protected]
[email protected]
[email protected]
[email protected]
[email protected]
[email protected]
[email protected]

I have smth like that, but it doesn't work properly for me..or i cant do it enter link description here

question from:https://stackoverflow.com/questions/65836903/renumbering-duplicate-lines-with-counter-awk

与恶龙缠斗过久,自身亦成为恶龙；凝视深渊过久,深渊将回以凝视…

1 Reply

深蓝 · Answer 1 · 2021-10-06T19:35:15+0000

A simple way to do it is maintain an array using the username as the index and increment it each time you read a user, e.g.

awk '{ print (($1 in a) ? $1 a[$1] : $1) "@email.com"; a[$1]++ }' file

The ternary (($1 in a) ? $1 a[$1] : $1) just checks if the user in in a[] yet, and if so uses the name plus the value of the array $1 a[$1] if the user is not in the array, then it just uses the user $1. The result of the ternary is concatenated with "@email.com" to complete the output.

Lastly, the value for the array element for the user is incremented, a[$1]++.

Example Use/Output

With your names in a file called users you would have:

$ awk '{ print (($1 in a) ? $1 a[$1] : $1) "@email.com"; a[$1]++ }' users
[email protected]
[email protected]
[email protected]
[email protected]
[email protected]
[email protected]
[email protected]

To Keep E-mail In Input File

If your input already contains an e-mail at the end of the username, then you simply want to output that record and skip to the next record, e.g.

awk '$1~/@/{print; next} { print (($1 in a) ? $1 a[$1] : $1) "@email.com"; a[$1]++ }' users

That will preserve [email protected] from your comment.

Example Input

jsmith
jsmith
kgonzales
shouston
[email protected]
dgenesy
kgonzales
jsmith

Example Output

[email protected]
[email protected]
[email protected]
[email protected]
[email protected]
[email protected]
[email protected]
[email protected]

Categories

sed - Renumbering duplicate lines with counter awk

sed - Renumbering duplicate lines with counter awk

Please log in or register to add a comment.

Please log in or register to reply this article.

1 Reply

Please log in or register to add a comment.

Just Browsing Browsing

Most popular tags