- Joined
- Oct 7, 2008
- Posts
- 337
- Reaction score
- 263
Just use a tool to find the top 20,000 UK websites or something and check those for any changes.
Just check what's in the various top domain files, there will be some incorrect data, but the domains are more mainstream.
Using some old files:
$ grep '^[^\.]*.uk$' alexa-top-1m.csv | head
2937,unblockpirate.uk
7706,eplsite.uk
11094,soccerlive.uk
14343,bl.uk
19619,parliament.uk
22375,kidsflix.uk
30800,mod.uk
32204,steamid.uk
34419,totalsportek.uk
42298,hsbc.uk
$ grep '^[^\.]*.uk,' majestic_million.csv | head
21921,608,friendsoftheearth.uk,uk,4144,5507,friendsoftheearth.uk,uk,22103,613,4135,5487
24331,678,easily.uk,uk,3870,5120,easily.uk,uk,24427,681,3869,5125
35358,1085,gov.uk,uk,3095,3961,gov.uk,uk,36581,1123,3043,3881
51561,1521,osws.uk,uk,2519,3270,osws.uk,uk,50556,1501,2547,3299
58746,1691,fashionunited.uk,uk,2355,2984,fashionunited.uk,uk,54008,1584,2462,3111
71303,1982,cancelme.uk,uk,2146,2743,cancelme.uk,uk,71089,1989,2150,2733
74125,2061,awanshost.uk,uk,2111,2576,awanshost.uk,uk,73793,2058,2116,2573
89270,2430,dbang.uk,uk,1932,2536,dbang.uk,uk,90175,2468,1923,2529
93954,2549,foreversun.uk,uk,1886,2208,foreversun.uk,uk,93773,2541,1887,2208
95022,2568,gglink.uk,uk,1876,2258,gglink.uk,uk,94975,2572,1876,2259
$ grep '^[^\.]*.uk"' top10milliondomains.csv | head
"127","gov.uk","8.13"
"935","nhs.uk","7.43"
"1175","parliament.uk","7.33"
"1231","bl.uk","7.31"
"9948","mod.uk","6.81"
"12623","royal.uk","6.74"
"17632","transitcenter.uk","6.67"
"20293","mssoc.uk","6.67"
"20294","warminstertownfc.uk","6.67"
"36443","ticketweb.uk","6.53"
$ cat umbrella-top-1m.csv | grep '^[^\.]*\.uk.$' | head
52218,jpimedia.uk
69114,royal.uk
92053,zpbt.uk
102172,netweaver.uk
105952,bl.uk
107826,mod.uk
108874,yummly.uk
111204,abaresearch.uk
115530,parliament.uk
116583,nic.uk
Yes, some of these never switched to .uk, as they've always been using .uk, so should be excluded.