Rejoining/recovery of galera cluster fails
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
OpenStack-Ansible |
Fix Released
|
High
|
Jesse Pretorius | ||
Liberty |
Fix Committed
|
Undecided
|
Darren Birkett | ||
Mitaka |
Fix Committed
|
Undecided
|
Jesse Pretorius | ||
Trunk |
Fix Released
|
High
|
Jesse Pretorius |
Bug Description
Infra node was down for about 2 days due to hardware failure. Upon bringing the node back up it fails to rejoin the cluster with errors[1]. This appears to be a bug in percona-
[0]
https:/
[1]
InnoDB: Starting an apply batch of log records to the database...
InnoDB: Progress in percent: 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 2016-06-07 16:47:12 7efea54c1700 InnoDB: Assertion failure in thread 139632160020224 in file page0cur.cc line 931
InnoDB: We intentionally generate a memory trap.
InnoDB: Submit a detailed bug report to http://
InnoDB: If you get repeated assertion failures or crashes, even
InnoDB: immediately after the mysqld startup, there may be
InnoDB: corruption in the InnoDB tablespace. Please refer to
InnoDB: http://
InnoDB: about forcing recovery.
20:47:12 UTC - xtrabackup got signal 6 ;
This could be because you hit a bug or data is corrupted.
This error can also be caused by malfunctioning hardware.
We will try our best to scrape up some info that will hopefully help
diagnose the problem, but since we have already crashed,
something is definitely wrong and this may fail.
Thread pointer: 0x0
Attempting backtrace. You can use the following information to find out
where mysqld died. If you see no messages after this, something went
terribly wrong...
stack_bottom = 0 thread_stack 0x10000
innobackupex(
innobackupex(
/lib/x86_
/lib/x86_
/lib/x86_
innobackupex(
innobackupex() [0x68249c]
innobackupex(
innobackupex(
innobackupex(
innobackupex(
/lib/x86_
/lib/x86_
summary: |
- Rejoining galera cluster fails + Rejoining/recovery of galera cluster fails |
Changed in openstack-ansible: | |
importance: | Undecided → High |
Changed in openstack-ansible: | |
status: | New → In Progress |
Changed in openstack-ansible: | |
assignee: | Darren Birkett (darren-birkett) → Jesse Pretorius (jesse-pretorius) |
Nobody got the opportunity to test this bug and was active during the bug triage of this week. Therefore, all the classification and work will be delayed to next week.
Sorry for the inconvenience.