30/11/2023
๐๐ข๐ญ๐๐ง ๐๐๐๐ก๐ง๐จ๐ฅ๐จ๐ ๐ข๐๐ฌ ๐๐ข๐ฆ๐ข๐ญ๐๐ ( ๐ญ๐ก๐๐ญ๐ข๐ญ๐๐ง๐ญ๐๐๐ก.๐๐จ๐ฆ ) is a software company based in Bangladesh โ sister concern of ๐๐ข๐ฌ๐ฌ๐๐จ๐ฆ ๐๐๐ ( ๐๐ข๐ฌ๐ฌ๐๐จ๐ฆ.๐๐จ๐ฆ ) California, USA. We have started our Bangladesh operation in 2020 during the pandemic, and have had steady success in providing innovative and cutting-edge tech products and services to companies worldwide. We specialize in Web, Mobile, and AI-based SAAS products and want to act as a catalyst for cultivating homegrown talents.
๐๐จ๐ ๐๐ข๐ญ๐ฅ๐: System Monitoring & Incident Management Specialist
๐๐๐ ๐๐๐๐๐๐๐:
The Sr. BSA โ DevOps, Major Incident Management, Site Reliability Engineer will assume a key operational role within the Connected Car System Operation program. This individual will have specific assigned responsibilities in support of the Product Development program, Production QA program, Production Operations, as well as administering the established Issue & Incident Management program.
๐๐๐ ๐๐๐๐๐๐๐๐๐๐๐๐๐๐๐๐:
-Connected Car feature development support for new model vehicles and technology platforms
-Production issue triage and tracking, preliminary troubleshooting activities, ticket creation and maintenance, ticket assignments, as well as vendor follow-up
-Working with the appropriate internal and external stakeholders to satisfy customer requests and provide resolution strategies for incoming issues
-Identify root causes of technical issues in production and out of box failures, and come up with the RCA document. Coordinate root cause analysis, problem management, and facilitation of recurring problem management meetings with management stakeholders, trend analysis, and strategic resolution
-Evaluate business and system requirements, ensure technical feasibility, develop, Fix and validate Connected Car system.
-Maintain knowledge and technical expertise of current production system, best practices tools and techniques and implement them in a reasonable and responsible manner.
-Identify issues and risks as they relate to the Issue/Incident Management program and take escalation actions as necessary
-Additionally, this individual will need to be available according to a shared coverage schedule (i.e. 24 x 7) for addressing P1 and P2 (Emergency) incidents as needed. Monitoring production systems and triaging and migrating alerts.
-Design and develop solutions to fix the reported defects, and stabilize the new products into Production.
-Interact with project teams for new products development giving insights from technical issues of previous products.
-Produce clean, unit-tested, and refactored code when required.
-Create system & technical design for downstream technical teams (e.g. Application server, Database etc.)
-Establish development environment and development guidelines (design & code review checklists, design principles etc.)
-Prepare detailed specifications from which programs will be written, designed, coded, tested and debugged.
-Develop RESTful (Representational State Transfer) web-services that can support high-volume transactions.
-Work with multiple stakeholders to analyze requirements, clarify design dependencies, create test plans, and support functional and non-functional requirements and software documentation in mathematical or diagram form to ensure technical accuracy, compliance and completeness.
-Support testing efforts by engaging in troubleshooting and provide solutions to issues.
๐๐๐๐๐๐๐๐ ๐๐๐๐๐๐, ๐๐๐๐๐๐๐๐๐๐ & ๐๐๐๐๐๐๐๐๐:
-B.S. required, B.S.E.E. or C.S. preferred
-7+ years of Operations or Production Monitoring experience, with a minimum of 5+ years in a similar role supporting and interfacing with management.
-Experience in supporting development and design environments with Java, J2EE/.NET and database technologies preferred
-Experience working on complex technical projects in a multi-vendor project environment.
-Incident and Issue Management experience required.
-Problem Management and Root Cause Analysis background required
-Experience using software development life cycle (SDLC) to deliver solutions and experience with developing web applications deployed in multi-tiered environments
-Experience writing & maintaining technical designs, functional designs, and integration documentation using either UML or EA
-Understanding to hands on development experience with web applications deployed in complex multi-tiered environments including:
-Proficient in deploying and monitoring applications on J2EE Web/Application servers like JBoss, WebLogic and Tomcat preferred.
-Hands on experience in databases like Oracle, SQL Server, MySQL. -Capable of writing complex DB queries, and performing SQL analysis and tuning preferred.
-Experience working with application & engineering teams to develop requirements that define monitoring and interpret alerting, notification and escalation requirements for managing an end user experience, assist with fault isolation, and deliver proactive environment health management analysis and reporting in compliance with published SLA's
-Knowledge of SOA vs micro services and their appropriate application is desired.
-Strong understanding of Object Oriented Analysis and Design (OOAD) concepts.
-A well-developed understanding of the theory and principles of operation of the internet and packet data protocols.
-Able to learn, or have pre-existing knowledge of, basic connected car architecture including systems integration, interface details, in-vehicle services/functions in order to understand the platform and environment for effective triage and troubleshooting capabilities.
-Strong coordination, communication, and networking sills
-Maintains professional demeanor
-Ability to work as a team and display excellent communication and interpersonal skills
-Capability to work under pressure and to tight timescales
-Able to multi-task and maintain composure
-Experience with SQL, Dynatrace, ELK, xMatters, Postman, JMeter, Status Page, or other useful tools for investigating and assessing impact during incidents
๐๐ฎ๐ฌ๐ญ ๐๐๐ฏ๐๐ฌ:
-Domain expertise in a particular class of incident, such as Call Routing Emergency Services, Telematics, Automotive with an interest in working more broadly
-Experience being a part of an on-call rotation (24x7)
-Experience writing broad user-facing communications (e.g. status pages, tweets) and/or targeted communications (e.g. direct emails, support ticket responses)
๐๐ข๐ญ๐๐ง ๐๐๐๐ก๐ง๐จ๐ฅ๐จ๐ ๐ข๐๐ฌ ๐๐ญ๐ is an equal opportunity employer. We encourage applications from candidates of all backgrounds.
๐๐๐ฅ๐๐ซ๐ฒ: Negotiable
๐๐จ๐ซ๐ค๐ข๐ง๐ ๐๐ข๐ฆ๐: 8:00 AM - 8:00 PM(7 days)(Roster basis)
๐๐จ๐๐๐ญ๐ข๐จ๐ง: Gulshan 1,Dhaka
Please send us your CV at โ๐ฐ๐ฎ๐ฟ๐ฒ๐ฒ๐ฟ@๐๐ต๐ฒ๐๐ถ๐๐ฎ๐ป๐๐ฒ๐ฐ๐ต.๐ฐ๐ผ๐บโ with the subject โSystem Monitoring & Incident Management Specialist". Only selected candidates will be contacted for the interview phase.
๐๐ฉ๐ฉ๐ฅ๐ข๐๐๐ญ๐ข๐จ๐ง ๐๐๐๐๐ฅ๐ข๐ง๐: 12 December,2023