COUNT UNIQUE ELEMENTS IN TEXT FILE WITH AWK ~ Tech Blog

Counting the distinct/unique elements of text file is a common task. Below is an example of doing this is AWK, using sample_data_1.txt.

1

2

 cat sample_data_1.txt \

| awk 'BEGIN{FS="\t"} NR>1{names[$2]=1} END{print length(names)}'

Here is what is happening above:

cat sample_data_1.txt – reading the file piping the data to AWK
BEGIN{FS=”\t”} – specifying the field separators of the file
NR>1 – Only executing the following code block if the record number is greater than 1 (removing the header)
names[$2]=1 – This script counts the distinct elements of column number 2. So here we are storing the values of this column in an array. AWK arrays are associated arrays (holding keys and values). Each value is simply set to “1” as a place holder for the value.
END{print length(names)} – printing the length of the names array

Using an array in AWK is much faster than a common alternative of using the sort and uniq:

1

cat sample_data_1.txt | cut -f2 | sort | uniq | wc -l

Tech Blog

Wednesday, 31 July 2019

COUNT UNIQUE ELEMENTS IN TEXT FILE WITH AWK

0 comments:

Post a Comment

Total Pageviews

Achievement

Live Traffic

Followers

About Me

I V RAMANA

Recent Comments

Categories

Popular Posts

Hot Topics

Video

News

Comments

Recent

Bottom Ad [Post Page]

Recent Posts

Mysql - How to reset the administrator password in ISPConfig 3

Socialize

Blog Archive

Search This Blog

Post Top Ad

Archive

Post Bottom Ad

Author Details

About Me

Tags

Full width home advertisement

Pages

Post Page Advertisement [Top]

Climb the mountains

Wednesday, 31 July 2019

0 comments:

Post a Comment

Total Pageviews

Achievement

Live Traffic

Subscribe To

Followers

About Me

I V RAMANA

Recent Comments

Categories

Popular Posts

Hot Topics

Video

News

Comments

Recent

Bottom Ad [Post Page]

Recent Posts

Mysql - How to reset the administrator password in ISPConfig 3

Socialize

Blog Archive

Search This Blog

Post Top Ad

Archive

Post Bottom Ad

Author Details

About Me

Tags

Full width home advertisement

Pages

Post Page Advertisement [Top]

Climb the mountains