On the Data Engineering Podcast episode I appeared on recently I received a question I hadn’t really considered very hard.
Is it wise to set up servers where all the processes are configured to be storage processes?
The rationale is if a storage process fails there is no need to go through a recovery. The potential downside is you might require more servers than before to achieve the same level of fault tolerance.
Is this a common configuration? I had never heard it before and I assume someone here has at least tried it at some point if it is not obviously flawed.