We are currently looking for skilled Site Reliability Engineers to maintain, develop, and run the network services, storage layers, and configuration of a petabyte scale AWS storage and Kafka stream stack.
* Develop and maintain the operational configuration of a Petabyte scale storage and stream analysis service
* Develop operational intelligence metric collection, visualization, and reporting via Prometheus, Grafana, and related tooling
* Work with Backend and Full Stack engineers to co-develop GraphQL and REST apis to surface ChatLogs data for various users.
* Work with Audio and Speech AI Engineers to accelerate development and deployment of heterogeneous analysis and training pipelines
To apply for this job please visit itjobpro.com.