Go Back  PPRuNe Forums > Misc. Forums > Passengers & SLF (Self Loading Freight)
Reload this Page >

BA delays at LHR - Computer issue

Wikiposts
Search
Passengers & SLF (Self Loading Freight) If you are regularly a passenger on any airline then why not post your questions here?

BA delays at LHR - Computer issue

Thread Tools
 
Search this Thread
 
Old 28th May 2017, 05:25
  #121 (permalink)  
 
Join Date: Dec 2010
Location: SCAL
Posts: 116
Received 13 Likes on 5 Posts
Originally Posted by PAXboy
The six phases of a big project is a cynical take on the outcome of large projects:
In my (38 years) experience most IT mega projects go in just two phases

1) Fire all the people who say it won't work

later

2) Fire all the people who said it would work
sherburn2LA is online now  
Old 28th May 2017, 05:36
  #122 (permalink)  
 
Join Date: Jul 2002
Location: In the sticks
Posts: 9,847
Likes: 0
Received 0 Likes on 0 Posts
I used to install UPS's (uninterruptible power supplies) as part of my job that guaranteed power for a set time. Years later we would go back when they failed to operate as the batteries that were on a constant trickle charge had a limited life and that no one had thought about changing them.
LTNman is offline  
Old 28th May 2017, 05:36
  #123 (permalink)  
 
Join Date: Jan 2002
Location: Europe
Posts: 1,200
Received 0 Likes on 0 Posts
Nialler

prevents against large physical attacks. Logical ones are not prevented against.
I thought it is possible to avoid logical attacks by having the codes/software written by different people. So mistakes and logical bombs happen only in one system. I hope it makes sense.
Rwy in Sight is offline  
Old 28th May 2017, 05:43
  #124 (permalink)  
 
Join Date: May 2008
Location: Paris
Age: 60
Posts: 101
Likes: 0
Received 0 Likes on 0 Posts
Originally Posted by Rwy in Sight
I thought it is possible to avoid logical attacks by having the codes/software written by different people. So mistakes and logical bombs happen only in one system. I hope it makes sense.
For something such as a a database subsystem and its the associated application layer there will be many hundreds and potentially thousands of coders involved.
Nialler is offline  
Old 28th May 2017, 06:06
  #125 (permalink)  
 
Join Date: Aug 2004
Location: moon
Posts: 3,564
Received 89 Likes on 32 Posts
freehills:

Quote:
Originally Posted by Sunfish View Post
As aa general rule, if IT is critical to the business, then you don't outsource it.
Major airlines that don't outsource IT
Delta (and they messed up)
Air New Zealand

And, frankly, it is a silly rule. I don't see many companies building their own PC operating systems, despite how critical that is. For a parallel, engines are also critical to the airline business, but since the break up of United in the 1930's, no airline hasn't outsourced designing & building engines
you misunderstand. there are multiple reputable and competitive sources for PC's, operating systems, engines (new and overhauled), tyres, etc. of course you can outsource all that without losing control and generate efficiencies in the process.

however, there is only one BA, and one set of BA IT systems that are intricately and inextricably bound up with BA'S business strategy. The two are inseparable and complementary, you cannot change business strategy without modifying business rules that are implemented in the software, period.

if you lose detailed control of the strategic IT in your business, you risk exactly what BA are now experiencing, because your contractor cannot know in detail what your strategic priorities are as well as you do, and those priorities change monthly.

for example, consider what software changes might be required to implement a marketing idea to increase baggage allowances.
Sunfish is offline  
Old 28th May 2017, 06:12
  #126 (permalink)  
ImageGear
Guest
 
Posts: n/a
To IPL a mainframe shouldn't take more than five to ten minutes.
and this is totally unacceptable as BA have found to their severe cost. Whoever took the decision to implement a system where recovery is to IPL and blow the reservations system out of the water with no failover needs to take the long walk.

To call the latest crop of "big servers", mainframes, is blatant misrepresentation. True fault tolerant mainframes no longer really exist.

30 Years ago, I ran a demonstration in Paris for airline CIO's and IT Directors, of multi-host file sharing across duplicate mainframes, where it was proved that on catastrophically failing one mainframe, the end result was not one transaction being lost anywhere between the reservations terminal and the failing computer memory module. The impact on the reservations clerk was a "wait" screen that lasted around 40 seconds. The solution sold, but not to BA.

There is absolutely no excuse, 30 years later, for any airline reservations department to accept anything less.

Some very big heads must roll.

Imagegear.
 
Old 28th May 2017, 06:33
  #127 (permalink)  
 
Join Date: May 2008
Location: Paris
Age: 60
Posts: 101
Likes: 0
Received 0 Likes on 0 Posts
Originally Posted by ImageGear
and this is totally unacceptable as BA have found to their severe cost. Whoever took the decision to implement a system where recovery is to IPL and blow the reservations system out of the water with no failover needs to take the long walk.

To call the latest crop of "big servers", mainframes, is blatant misrepresentation. True fault tolerant mainframes no longer really exist.

30 Years ago, I ran a demonstration in Paris of multi-host file sharing across duplicate mainframes, where it was proved that on catastrophically failing one mainframe, the end result was not one transaction being lost anywhere between the reservations terminal and the failing computer memory module. The impact on the reservations clerk was a "wait" screen that lasted around 40 seconds. The solution sold, but not to BA.

There is absolutely no excuse, 30 years later, for any airline reservations department to accept anything less.

Some very big heads must roll.

Imagegear.
You misunderstood me. My entire career has been working on mainframes. The type of fail over capabilities you describe are not quite thirty years old, but you are correct that not simply sysplex, but geographically dispersed sysplex is possible. Not just possible but standard. My clients all have datacentres at sites which h are remote from each other and are cycled on a scheduled basis. OK, I'm an old it head, but I'm sure that pilots will Nod at the concept of double or triple redundancy. Their lives depend on it. In my case it is just my job.

My fear always is that a single system failure might not be restricted or contained when it is a logical or intrinsic programmer error which with the cold logic of object code propagates through the redundant systems also. The problem in your primary hydraulic system is not actually isolated because the same problem which led to its failure exists on the fallback.
Nialler is offline  
Old 28th May 2017, 06:57
  #128 (permalink)  
 
Join Date: May 2010
Location: UK
Posts: 56
Likes: 0
Received 0 Likes on 0 Posts
Why is everyone being so quick to accept the official line? A power supply failure taking down multiple systems across multiple data centers. Possible but it wouldn't be at the top of my list of likely explanations.
BetterByBoat is offline  
Old 28th May 2017, 07:19
  #129 (permalink)  
Thread Starter
 
Join Date: Apr 2010
Location: London
Posts: 7,072
Likes: 0
Received 0 Likes on 0 Posts
Exactly - and should also be (relatively) easy to fix........................

some sort of botched software upgrade is my guess -... but blaming the hardware is probably a first step to trying to save your job
Heathrow Harry is offline  
Old 28th May 2017, 07:27
  #130 (permalink)  
 
Join Date: Mar 2010
Location: The Home of the Gnomes
Posts: 412
Likes: 0
Received 3 Likes on 2 Posts
Some very big heads must roll.
This is BA. Of course they won't.
Tay Cough is offline  
Old 28th May 2017, 07:38
  #131 (permalink)  
 
Join Date: Apr 2006
Location: ESSL
Age: 79
Posts: 61
Likes: 0
Received 0 Likes on 0 Posts
Originally Posted by Heathrow Harry
Exactly - and should also be (relatively) easy to fix........................

some sort of botched software upgrade is my guess -... but blaming the hardware is probably a first step to trying to save your job
Time to back to pen and paper. Back in the day (1970) in the brand new high tech terminal 1 the only computer we had was the Solaris information board and that broke down often. Paper pax manifest and hand written load sheets.
FlightCosting is offline  
Old 28th May 2017, 07:45
  #132 (permalink)  
 
Join Date: Aug 2007
Location: West London, UK
Posts: 12
Likes: 0
Received 0 Likes on 0 Posts
I seem to remember that two or three years ago BA were putting Navitaire into LHR, LGW, and LCY as part of what I was told was a "backup strategy" (I presume they meant a fallback strategy, but there you go). My involvement with UK airline IT ended shortly afterwards - does anyone know if they ever did it?
xs-baggage is offline  
Old 28th May 2017, 08:22
  #133 (permalink)  
 
Join Date: Mar 2002
Location: London, UK
Posts: 437
Likes: 0
Received 0 Likes on 0 Posts
Originally Posted by sherburn2LA
In my (38 years) experience most IT mega projects go in just two phases

1) Fire all the people who say it won't work

later

2) Fire all the people who said it would work
Typo. You meant:-

2) *Promote* all the people who said it would work...

["*We* know it will work, it's the silly IT nerds who couldn't do it. Sack them, promote us for having the brilliant idea in the first place"].
RomeoTangoFoxtrotMike is offline  
Old 28th May 2017, 08:28
  #134 (permalink)  
 
Join Date: Apr 2007
Location: UK
Posts: 38
Likes: 0
Received 0 Likes on 0 Posts
I worked 40 years in safety and reliability in transportation industries, laterally with ATC. I could tell many stories of power supply disasters in so called very resistant systems. In the Far East I was doing a project in a road control centre. I asked them about power supply resilience and was assured they had all the batteries and generators to make it bulletproof. THE VERY NEXT DAY on entering the centre there were worried looking managers and a long extension lead across the floor leading the back of a server rack. “We had a power failure and had to do that to get it working again”.

In the same country the main and backup power supply went through the same circuit breaker which failed taking down a complete train line.

In UK ATC the last serious power outage was 15-20 years ago. Since then NATS has vastly improved the power systems so, whilst nothing is impossible, I’d be extremely surprised if power was the cause. If it did happen the most likely cause would be maintainer error rather than a straight hardware or software failure in the power supplies. Many of NAT’s recent very occasional issues have been due to software upgrades not going to plan but they are nearly always recoverable to the backup system or old software in minutes. The up to hours interruption to some traffic flow is only due to the time to rebuild the traffic flow to one that works safely; in an airport like Heathrow, as it’s working at >95% capacity, a small blip can’t be recovered in minutes. NATS does its software upgrades only in quiet periods and would never chose a May Bank Holiday w/e to do it. Unlike an airline whose planes are on the ground in incidents like current one, so safe, ATC has many planes in the sky so has to be much more careful.
fchan is offline  
Old 28th May 2017, 08:33
  #135 (permalink)  
 
Join Date: Mar 2002
Location: London, UK
Posts: 437
Likes: 0
Received 0 Likes on 0 Posts
Originally Posted by BetterByBoat
Why is everyone being so quick to accept the official line? A power supply failure taking down multiple systems across multiple data centers. Possible but it wouldn't be at the top of my list of likely explanations.
I can see how a "power failure" in a specific subsystem, the consequences of which were neither fully thought through (or were ignored), the response to which was mishandled, could rapidly spiral out of control. It's probably at this point that having outsourced IT half a world away really began to bite...
RomeoTangoFoxtrotMike is offline  
Old 28th May 2017, 08:39
  #136 (permalink)  
 
Join Date: Nov 2000
Location: Cambridge, England, EU
Posts: 3,443
Likes: 0
Received 1 Like on 1 Post
Originally Posted by LTNman
I used to install UPS's (uninterruptible power supplies) as part of my job that guaranteed power for a set time. Years later we would go back when they failed to operate as the batteries that were on a constant trickle charge had a limited life and that no one had thought about changing them.
????? I don't see what use a UPS is if it doesn't monitor itself, surely it will have reported that it needed a new battery even if the bureaucratic systems for maintaining it failed???
Gertrude the Wombat is offline  
Old 28th May 2017, 08:43
  #137 (permalink)  
 
Join Date: Jan 2008
Location: The Smaller Antipode
Age: 89
Posts: 31
Received 17 Likes on 10 Posts
A million years ago - circa 1970's - BOAC sent us home from New York as passengers, unscheduled - pick up your tickets at the airport. On arrival JFK the check in girl advised us that we couldn't travel as "the system was down" and she couldn't print our tickets. The Flt. Eng. handed her his pen. We flew home.

It's called progress.
ExSp33db1rd is offline  
Old 28th May 2017, 08:49
  #138 (permalink)  
 
Join Date: May 2010
Location: liverpool
Posts: 15
Likes: 0
Received 0 Likes on 0 Posts
There's quite a gem allegedly from the great Alex himself been leaked on flyertalk, apparently from their internal network, what do they call it, yammer?

Could have joined BA a few years back, thank god I didn't. Great company going down the pan.
coalencanth is offline  
Old 28th May 2017, 08:55
  #139 (permalink)  
 
Join Date: Sep 2007
Location: Durham
Age: 79
Posts: 302
Likes: 0
Received 0 Likes on 0 Posts
Originally Posted by FlightCosting
Time to back to pen and paper. Back in the day (1970) in the brand new high tech terminal 1 the only computer we had was the Solaris information board and that broke down often. Paper pax manifest and hand written load sheets.
Pen and paper with some Blu Tack would have given passengers some kind of information in the terminals, however that would have needed someone with common sense and not a keyboard stuck to their hands.
oldart is offline  
Old 28th May 2017, 09:05
  #140 (permalink)  
 
Join Date: Jan 2002
Location: Europe
Posts: 1,200
Received 0 Likes on 0 Posts
Great company going down the pan.
It seems they get the best in Europe to recover from severe schedule disruption.
Rwy in Sight is offline  


Contact Us - Archive - Advertising - Cookie Policy - Privacy Statement - Terms of Service

Copyright © 2024 MH Sub I, LLC dba Internet Brands. All rights reserved. Use of this site indicates your consent to the Terms of Use.