Case study: polluted reports shows how system can be polluted with dummy data. Saving data (even HTTP referer) without validation can contaminate system as well: The results show astonishing 28KB for storing single value: Next time you see Analytics shards worth 600 GB – recall this post.
Can “site visit frequency from specific place (or better, certain company office)” be just a query away? The needed analytics data is already collected by Sitecore, hence data mining could roughly be: Figure out area postal code (or reverse it by IP using any reverse IP lookup) Find all contacts that have the same detailsContinue reading “How often the site is visited from specific place?”
Agenda Analytics reports have suspicious statistics with lower conversion rates compared to other systems. Can we find out why? It seem that healthy data is diluted with junk/empty interactions with no value. We assume robot/crawlers activity gets recorded. Is there any OOB protection in Sitecore? Filter out robots by user agents Sitecore blacklists robots viaContinue reading “Case study: polluted reports”