#10 implemented capturing of enhanced information when a bad domain is detected.
We need to develop reporting around that - I currently have a dashboard but it's loosely put together and needs refining.
Commit b1911bf2 implemented a utility for viewing the generated screenshots, but again there's definitely some refinement possible
Activity
29-Mar-22 07:40
assigned to @btasker
03-Apr-22 10:21
mentioned in commit 58613d40600aeb67b12f7f4f420cb680230aeb3d
Message
Downsample bad-domains stats ready for reporting in websites/privacy-sensitive-analytics#12
03-Apr-22 10:25
We now capture stats about the number of bad domain requests.
Number of writes originating from a bad domain:
Number of unique bad domains observed in period
That ones actually currently probably less useful than it sounds. In most periods it'll be 1 or 2, and there's no reliable way to use it to extrapolate out to later periods (were the 3 in the next hour the same bad-domain or different ones).
If we want to be able to capture that, we'd need to write the actual domain in as a field or tag value. The problem with that is you either risk runaway cardinality, or accept that the data's not really downsampled (because we'd need to write a point per hit).
Will give that one some more thought
03-Apr-22 10:46
mentioned in commit 4ea7009281ef11d091d4cc099e652a3c8c9b6586
Message
Write a list of observed bad domains and associated pages into long term storage for websites/privacy-sensitive-analytics#12
We write these as a field value to ensure that a slew of dodgy domains doesn't impact our cardinality
03-Apr-22 10:58
We now write a list of the domains in, so I've added a graph to the dashboard that shows the number of unique bad domains in the time period - it uses better Flux than the downsampling version above
03-Apr-22 11:02
mentioned in commit 11820d97dd320733791e45cf90836c5471c51efd
Message
Update historic dashboard to give bad domain stats for websites/privacy-sensitive-analytics#12
03-Apr-22 11:02
We have reporting in place
03-Apr-22 11:03
I've since opted to remove the screenshot functionality (#14) so there's nothing extra to do here.