Site Reliability Engineer, Observability
Company: Intradiem
Location: Alpharetta
Posted on: May 7, 2022
Job Description:
Job Title: - - - - - - -Site Reliability Engineer,
ObservabilityLocation: - - - - - - -Remote/VirtualReports to: - - -
- -VP of DevSecOps -Have you ever sat on hold waiting for customer
service and wondered why this was still the norm in 2022? Us too.
Intradiem's intelligent automation solution for customer service
teams is reinventing customer service for everyone.Who We
AreIntradiem is a technology company on a mission to reinvent
customer service through automation. -What We DoWe develop
innovative, AI-powered Intelligent Automation solutions for contact
center and back-office teams. Our solutions currently support
hundreds of thousands of customer service agents for brand-name
organizations, powering hundreds of millions of automated actions
saving customers tens of millions each year.How We WorkWe take a
"problem-out" approach, asking customers to help us understand
their business problems, exploring potential solutions together,
incorporating their feedback, and releasing solutions that solve
those problems.Our CultureWe take a "people-first" approach,
treating employees, customers and each other with the dignity and
respect we all deserve. Intradiem employees enjoy a family-first
culture, transparent leadership, and unfettered growth
opportunities. -Our ValuesWe believe in service, encouraging our
employees to contribute time and energy to causes that help improve
the people and communities in which they live and work. We are
guided by three core values:
- Servant's Heart-caring enough about other people to understand
what their problems are and placing the needs of colleagues,
customers, and others over personal objectives.
- Craftsman's Attitude-taking pride in the work we do and
creating solutions that really solve the problem at hand (and
trying again if the first attempt doesn't do the trick).
- Revolutionary Spirit-leaving the world a better place than it
was when we found it, and doing things we would be proud to brag
about to our grandchildren. -Your RoleThis Site Reliability
Engineer will have a strong understanding of large-scale computing
solutions. You have experience working as a DevSecOps or Site
Reliability Engineer in a scaled cloud environment and have
implemented automated solutions across a variety of applications
and systems. You enjoy writing code and creating automation to
manage your services.Your Responsibilities
- Identify monitoring/alerting/observability needs, and influence
team culture to bring this rigor into the daily development
cycle
- Collaborate closely with Development and SRE teams to implement
o11y best practices
- Design and build new features for infrastructure and services
observability. Dive into new technologies and figure out how to
best monitor them.
- Manage, maintain, and scale the infrastructure responsible for
telemetry frameworks used throughout Intradiem's cloud-based
application services to capture, transport, store and analyze the
telemetry data.
- Develop solutions to implement the SLO/SLI requirements,
including visualization of the monitoring dashboard
- Reduce toil by creating observability automation that can be
reused across our teams -Your Background
- Bachelor's Degree in Computer Science or related field, or 5+
years relevant work experience
- 5+ years of software engineering experience -
- 5+ years of Linux experience
- 5+ years of experience with scripting languages such as Bash,
Python, Shell, or JavaScript
- 3+ years of operating production systems on a major cloud
platform (AWS, GCP, Azure)
- 3+ years of programming experience & proficiency in Java and at
least one other high-level programming language such as Python,
C++, Go, C#
- Experience with implementing and using observability tools such
as OpenTelemetry, Dynatrace, AIOps, Prometheus, Grafana, Elastic
Search, Splunk in measuring and optimizing around the Four Golden
Signals
- Experience with version control or source code repositories and
CI/CD tools: Jenkins, Git (Bitbucket), Nexus, Maven, etc.
Keywords: Intradiem, Alpharetta , Site Reliability Engineer, Observability, Engineering , Alpharetta, Georgia
Didn't find what you're looking for? Search again!
Loading more jobs...