We've had a mix of ~40 t2.u & c4.l instances running for a year with no downtime. Our i3.4xl has fully borked twice (memorable when we lose the ephemeral drives and need to reconstitute the analytics data).
Though it will be much more expensive and less performant, we're moving the system to an RDB-backed c4 soon for reliability, the people time to recover is too expensive.
Anecdata:
We've had a mix of ~40 t2.u & c4.l instances running for a year with no downtime. Our i3.4xl has fully borked twice (memorable when we lose the ephemeral drives and need to reconstitute the analytics data).
Though it will be much more expensive and less performant, we're moving the system to an RDB-backed c4 soon for reliability, the people time to recover is too expensive.