IT Job Pro

Most Popular Tech Job site – Find Jobs || Post Jobs

Lead Associate – Full Stack Developer (Site Reliability Engineering, SRE)

Reston VA Fannie Mae

Company Description

At Fannie Mae, futures are made. The inspiring work we do makes an affordable home a reality and a difference in the lives of Americans. Every day offers compelling opportunities to modernize the nation's housing finance system while being part of an inclusive team using new, emerging technologies. Here, you will help lead our industry forward, enhance your technical expertise, and make your career.

Job Description

As a valued colleague on our team, you will act as a team lead in the designing, producing, testing, or implementing software, technology, or processes, as well as lead processes for creating and maintaining IT architecture, large scale data stores, and cloud-based systems.

You will apply your expertise in software and systems engineering to ensure that both our internally critical and externally visible systems meet the appropriate performance needs of our users. You will serve as a champion of service availability, efficiency, automation, monitoring, and capacity management. Specifically, you will leverage your skills and experience in Amazon Web Services, software development with Java and/or Python, customization in Splunk and/or Dynatrace, and automation in Selenium and/or Blue Prism (among others) to enable increased feature velocity and continuous improvement.


The Service Reliability Engineering (SRE) Lead Associate role will offer you the flexibility to make each day your own, while working alongside people who care, so that you can deliver on the following responsibilities:

  • Independently determine the needs of the customer and create solution frameworks.
  • Design and develop moderately complex software solutions to meet needs.
  • Use a process-driven approach in designing and developing solutions.
  • Implement new software technology and coordinate end-to-end tasks across the team.
  • May maintain or oversee the maintenance of existing software.



Minimum Required Experiences

  • 4+ years

Desired Experiences

  • Bachelor degree or equivalent in Computer Science, Management Information Systems (MIS), Systems Engineering, or related field
  • 10+ years of Full stack engineering experience or experience in Scripting for Test Automation
  • Experience in conducting failover and failback exercises, blue green deployments
  • Good to have deep knowledge of Service Now, Splunk, Dynatrace
  • Cloud Developer or Architect Certification
  • Experience with Scaled Agile Framework (SAFe) and Jira / Confluence
  • Experience in conducting disaster recovery plans and executing failover tests
  • AIOPS: Big Panda, Moogsoft, Artificial Intelligence (Al) and Machine learning (ML) Frameworks 
  • Understanding of Java performance monitors (JVM, GC, Heap Size, Message Broker)
  • Experience creating JMeter and Selenium scripts


  • Design and Implement Full Stack Java based custom tooling solutions aimed at automating and optimizing away toil. 
  • Instantiate Site Reliability Engineering practice at Fannie Mae igniting the practice, principles, and culture leading by example. Assist in training skilled peer er and partnering with peer platform embedded SRE teams. 
  • Introduce enterprise capabilities, tools, and innovation improving availability in a multi-cloud ecosystem by evolving observability, monitoring, logging, dashboard visualization, CI/CD integration, continuous testing (performance, smoke, regression, functional, chaos) introduce continuous improvement, standardization/automation, capabilities to conduct destructive and resiliency testing
  • Introducing self-healing and autonomic capabilities solving for complex operational and systemic issues with precision including building and training models, automating cognitive processes, leveraging cutting edge technologies to improve availability of products we provide to customers
  • Automate key SRE metrics and IT Service Operations processes including customer impact, % availability of critical business flows, SLO/SLI adherence, error budget, automate incident process for IT Service Operations through data integrating with unified communications, alerting/notification systems.
  • Share support responsibilities for critical applications and customer journeys on-boarded to SRE including remediation of issues through Agile
  • Excellent problem-solving skills and proactivity in resolving issues / blockers
  • Excellent verbal / written communication skills, relationship management skills, and ability to collaborate with multiple stakeholders
  • Eagerness to learn and ability to work independently with minimal guidance


Proven Technical Expertise with one or more of the following:

  • Software Development: Java/J2EE, REST, Micro Services, Messaging Technologies like Kafka or MQ, JavaScript frameworks like React or Bootstrap, SQL 
  • OS and Platform – Linux; Cloud Technologies AWS, GCP or Azure; Container platforms 
  • Cl/CD and Automation: Jenkins, Gitlab, SonarQube, Artifactory
  • Observability and AIOPS: Grafana, Prometheus, ELK or SPLUNK, Jaeger or Zip kin, AppDynamics, Dynatrace or similar
  • Testing: Gremlin, Chaos Monkey, Chaos tool kit, JMeter, Blaze meter, Load runner

Additional Information

Job REF ID: REF10922B

The future is what you make it to be. Discover compelling opportunities at

Fannie Mae is an Equal Opportunity Employer, which means we are committed to fostering a diverse and inclusive workplace. All qualified applicants will receive consideration for employment without regard to race, religion, national origin, gender, gender identity, sexual orientation, personal appearance, protected veteran status, disability, age, or other legally protected status. For individuals with disabilities who would like to request an accommodation in the application process, email us at

To apply for this job please visit