This week in Mozilla RelEng – February 14th, 2014
Highlights:
- Aki spent most of his week braindumping Mozharness and VCS Sync knowledge before he goes on PTO, which will help to ensure we don't drop the ball on anything while he's gone.
- Pete migrated repositories from the legacy vcs2vcs system to the much more stable VCS Sync system. Moving these reduces the chance of a tree closure.
- Geoff turned on full ADB logcat logs for many Android tests. This should help significantly when debugging test failures on RelEng machines.
- Glandium enabbled sccache for some try builds, which is expected to improve build times by having a much better cache hit rate than ccache.
- Many of us started early work on jacuzzis, which will also improve build times when we're ready to deploy it more widely.
- Rail increased our usage of AWS spot instances, which lowers our AWS bill with no impact to turnaround times.
- Mike thinks he found the reason why "hg purge" fails in some circumstances. Once addressed, we can spend less time recreating full source directories, especially during try builds.
- Armen enabled testing of b2g reftests on EC2 machines. Once problems with these are sorted out, we can move all of these tests to EC2, which allows us to achieve better reliability and scale for them.
- Buildduty
- add nagios bug queue checks for buildduty and loan requests components
- please upload new talos.zip to capture talos changes for android
- Add a mutant version of sixgill to the tooltool server
- Deploy Android 2.3 AVD definitions to Ash for testing
- nightly builds failing because of switch to ftp-ssl
- General Automation
- Bump mock timeout
- Create two S3 buckets and make them available from build slaves
- split Android 4.0 robocop (rc) into 5 chunks
- limit pvtbuilds uploads
- self-serve agent on bm66 eating jobs
- Mozharness' vcs_checkout() should attempt repo cloning more than once & output a TBPL compatible failure message
- Turn off all tests on UX project branch
- Windows "remoteFailed: [Failure instance: Traceback (failure with no frames):
: Connection to the other side was lost in a non-clean fashion." - Add s3:PutObjectAcl access to shared cache buckets from build slaves
- gaia-ui tests need to dump a stack when the process crashes
- Sometimes building files more than once on mac and linux
- Disable non-unified builds on Aurora
- Intermittent gaia-ui-test failures with "Unable to purge /builds/slave/talos-slave/test/gaia!" (or "Unable to purge /builds/slave/test/gaia!")
- Do not use spot instances for some builders
- Tracking bug for 3-feb-2014 migration work
- Flags passed to jit-test from mozharness should match flags in make check.
- Remove configs for nanojit-central
- final verification should report remote IP addresses
- Update signing server whitelists
- b2g 1.3t branch support
- Do periodic PGO builds on the UX branch
- Allow test slaves to save and upload files somewhere, even from try
- review nagios alerts for builds-running, builds-pending
- Temporarily revert the change to m3.medium AWS instances to see if they are behind the recent increase in test timeouts
- Tracking bug for 9-dec-2013 migration work
- Switch update server for Buri from OTA to FOTA by using the solution seen in bug 935059
- Reset Fig, Larch and Elm configurations
- [Tracking bug] automation support for B2G v1.3.0
- Loan Requests
- Need linux64 test slave to debug 959752
- Slave loan request for a talos-mtnlion-r5 machine
- loan request for graydon [Ubuntu 64]
- I need 32 && 64 bit ubuntu test slaves setup just like the ones that report to tbpl
- Please assign a Linux32 test machine to me
- Other
- Halting on failure while running ['unzip', '-q', '-o', '/builds/slave/test/build/b2g-22.0a1.en-US.android-arm.tests.zip']
- reduce ebs usage on try nodes
- Platform Support
- Log # network bytes transmitted/received to Graphite
- Give graydon the ability to test Android 2.3 on an emulator work live in Ash
- AWS machines should run b2g emulator reftests with GALLIUM_DRIVER=softpipe
- Run Windows 8 unit tests on Date branch
- Delete dead code in tools repo
- Releases
- tracking bug for build and release of Firefox and Fennec 28.0b1
- Add SeaMonkey 2.25 Beta 1 to bouncer
- outdated link: releases.mozilla.org/pub/mozilla.org/thunderbird/releases/latest
- Releases: Custom Builds
- [Partner] Yahoo configs for 27 which includes new toolbar version
- Clean up MSN bundles and restore MSN add-on to the correct URL
- Repos and Hooks
- Tools
- buildapi has wrong timestamps in json output
- high pending for try linux-hp builds. Are we unintentionally ignoring builds with aws_watch_pending
- fix backwards logic in slaverebooter
- add esr24 relbranches to gecko-dev
- Add cppunittests as option in TryChooser
- buildapi/recent/ returns start and end times as a unix timestamp in _pacific_ time
- update buildername regexs for Android 2.3 Emulator
- Report Bug # in AWS loan report
- slaveapi still files IT bugs for some slaves that aren't actually down
- slave health's recent job start and end times are wrong
- slave rebooter doesn't reboot slaves when graceful shutdown fails
- slaveapi and/or slaverebooter need to be adjusted to deal with new buildapi times
- Balrog: Backend
- API for manipulating balrog rules
- send cef events to syslog's local4
- Balrog shouldn't serve updates to older builds
- Buildduty
- Add swap to linux build machines with
- Increase AWS testing limit
- Report impaired AWS non slave machines
- General Automation
- Intermittent "BaseException: Failed to connect to SUT Agent and retrieve the device root."
- Self-serve should be able to request arbitrary builds on a push (not just retriggers or complete sets of dep/PGO/Nightly builds)
- Provision enough in-house master capacity
- aws sanity check shouldn't report instances that are actively doing work as long running
- Do we need desktop Firefox nightlies on the b2g26 (v1.2) branch anymore?
- keep buildbot master twistd logs longer
- Create IAM roles for EC2 instances
- Rooting analysis mozconfig should be in the tree
- use tbirdbld account to submit thunderbird data to balrog
- [Meta] Some "Android 4.0 debug" tests fail
- switch routing to hg.m.o to use public internet
- b2g build improvements
- Please schedule mozbase unit tests on Cedar
- [tracker] run Android 2.3 test jobs on EC2
- Run mozbase unit tests from test package
- aws_watch_pending.py should use jacuzzis
- Make Pine use mozharness production & limit the jobs run per push
- fx desktop builds in mozharness
- Move Firefox Desktop repacks to use mozharness
- ensure the timezone and time are set properly on tegras (and other devices)
- Use spot instances for regular builds
- Install ant on builders
- windows ix test machines don't always reboot
- Make B2G device builds periodic
- Add Linux32 debug SpiderMonkey ARM simulator build
- Implement ghetto "gaia-try" by allowing test jobs to operate on arbitrary gaia.json
- Turn off PGO, talos and debug builds on UX project branch
- spidermonkey_build.py looks for gcc in two different places
- Show SM(Hf) builds on mozilla-aurora, mozilla-beta, and mozilla-release
- Remove 'update_files' logic from B2G unittest mozharness scripts
- [tarako][build]create "tarako" build
- Build the Gecko SDK from Firefox, rather than XULRunner
- move off of dev-master01
- Update b2g_bumper.py with a git dict
- Schedule JB emulator builds and tests on cedar
- Intermittent Linux spot builder "command timed out: 1800 seconds without output, attempting to kill" while trying to install mock
- Run jit-tests from test package
- Use mach to invoke printconfigsetting.py
- Loan Requests
- Please loan dminor Android 2.3 Emulator test instance
- Loan glandium a m3.medium test slave
- Need a win32 slave to debug Lightning pymake issues
- Please loan dminor instance to build Android 2.3 Emulator
- Slave loan request for a tst-linux64-ec2 machine
- Linux 64-bit test slave for bug 926264
- Other
- stage NFS volume about to run out of space
- selfserve agent drops requests
- [tracker] Machine move SCL1 -> SCL3
- [tracking] infrastructure improvements
- s/m1.large/m3.large/
- Platform Support
- [tracker] Move away from the rev3 minis
- Setup in-house buildbot masters for remaining in-house testers
- signing win64 builds is busted
- Windows slaves often get permission denied errors while rm'ing files
- Deploy ndk-stack on foopies
- Re-allocate tegras that were on decommissioned foopy118
- release-automation
- Create SSL products in bouncer as part of release automation
- Figure out how to offer release build to beta users
- Repos and Hooks
- Tools