I’m working on a master’s in data science, and I have a course project on distributed data mining. We have the full history of logs from logs.tf to use for our analysis. What questions does the TF2 community have that big data mining could solve?
Here are some ideas we have so far:
-Finding cheaters with outlier analysis
-Heatmap of deaths on maps
-Metastatistics (how likely you are to win the round if you win the midfight, how likely you are to hold last with uber disad, etc)
-Tracking player improvement over time
-Future match prediction
These are our quick ideas, but we’re very interested in other ideas. Let us know what else you think of! And of course we wouldn't be doing all of this, we would likely focus on one or two areas. Give us new ideas!
Here’s some more technical stuff about the assignment if you’re interested-
Essentially we’re looking to take big data and gain insights from it using really powerful machines (a TB of RAM, 128 threads, 20 GPUs, etc). We’re not allowed to use neural networks (we’re taking that class at the same time), but all other forms of machine learning and data mining are allowed. If you have any ideas for machine learning or data mining methods that you think would be good for this data, hit us with those too!
And of course, huge shoutout to Zoob from logs.tf and his incredible work on that site