Using predictive modeling as a hammer when the nail needs more thinking
The business problem is to put a lifeguard station on a beach to save some lives (i.e., find the best location for the lifeguard station). This is not really a predictive modeling problem. But that’s the hammer our data scientists have and they have access to fancy libraries. There is also some historical data: swimmers rescued and drowned at other beaches. It all checks out. Resistance to ๐ฑ๐ช๐ฑ ๐ช๐ฏ๐ด๐ต๐ข๐ญ๐ญ ๐ฑ๐ณ๐ฐ๐ฑ๐ฉ๐ฆ๐ต is futile.
Transforming the problem into an objective function could have signaled that this is an optimization problem (a prescriptive modeling problem), but that step was skipped. In the picture shown, we may need a solution:
– minimizes distance => ๐ฆ๐ผ๐น๐๐ฒ๐ฑ ๐๐๐ถ๐ป๐ด ๐ฝ๐ถ๐ฝ ๐ถ๐ป๐๐๐ฎ๐น๐น ๐ณ๐ฎ๐ป๐ฐ๐_๐น๐ถ๐ฏ๐ฟ๐ฎ๐ฟ๐
while also…
– minimizing time => ๐ง๐ต๐ฒ ๐ฑ๐ผ๐บ๐ฎ๐ถ๐ป ๐ฒ๐
๐ฝ๐ฒ๐ฟ๐ ๐ฒ๐ป๐๐ฒ๐ฟ๐ ๐๐ต๐ฒ ๐ฟ๐ผ๐ผ๐บ
– minimizing swimming => ๐ง๐ต๐ฒ ๐น๐ฎ๐ฏ๐ผ๐ฟ ๐๐ป๐ถ๐ผ๐ป ๐ถ๐ป๐๐ฒ๐ฟ๐๐ฒ๐ป๐ฒ๐
– minimizing time to ice cream => ๐ง๐ต๐ฒ ๐ฒ๐
๐ฒ๐ฐ๐๐๐ถ๐๐ฒ ๐น๐ฒ๐ฎ๐ฑ๐ฒ๐ฟ๐๐ต๐ถ๐ฝ ๐๐๐ฒ๐ฝ๐ ๐ถ๐ป
– [not shown] minimizing walking on sand => ๐ง๐ต๐ฒ ๐๐ฒ๐ฝ๐ฎ๐ฟ๐๐บ๐ฒ๐ป๐ ๐ผ๐ณ ๐๐ฎ๐ฏ๐ผ๐ฟ ๐ฟ๐ฒ๐พ๐๐ถ๐ฟ๐ฒ๐บ๐ฒ๐ป๐
and hopefully not…
– maximizing time => ๐ ๐ท๐๐ป๐ถ๐ผ๐ฟ ๐ฑ๐ฎ๐๐ฎ ๐๐ฐ๐ถ๐ฒ๐ป๐๐ถ๐๐ ๐๐ผ๐น๐๐ฒ๐ ๐๐ต๐ฒ ๐ฝ๐ฟ๐ผ๐ฏ๐น๐ฒ๐บ
So, the ideal solution requires more thinking about the problem. For example, maximizing the number of lives saved may actually require constraints on how to minimize time so that lifeguards don’t risk their lives during the rescue.
The law of the instrument works a little too well in predictive modeling (and more generally in machine learning). Objective functions are often lost in translation when they should be an explicit step in the modeling process. Best practice tends to favor performance metrics, even though achieving the highest performance on the wrong function is clearly useless (and sometimes detrimental).
More focus on objective functions and less obsession with “better performance” may be what we need. This would underline the importance of problem formulation and domain knowledge, and undermine the ๐ฑ๐ช๐ฑ ๐ช๐ฏ๐ด๐ต๐ข๐ญ๐ญ ๐ฑ๐ณ๐ฐ๐ฑ๐ฉ๐ฆ๐ต solution.
A combination of Warren Powell‘s writing and the accompanying xkcd comic inspired this post (courtesy of xkcd.com).