So...nobody else with an idea how one could access the QPOSCNT register in a faster way/a way that does not stall the CPU and influence (timer) interrupts?
It's somewhat amazing for me TI integrates a fast and easy to use quadrature decoder into AM3358 but reading the current decoder value is a bottle neck that makes eQEP nearly unusable for real-time applications.