Datacenter Hell
Posted by armwulf@reddit | talesfromtechsupport | View on Reddit | 65 comments
I am a fiber optic engineer. And this datacenter will be the death of me. Recognizing the big words will bore the crap out of most people, I'll give the abridged version.
This particular datacenter does not train or equip it's employees properly, and as a result, anytime they have a slightly complex problem, they make it worse by trying to fix it.
Then they call me for help, despite the fact I don't even work for them. I am their OSP vendor, meaning I fix problems outside and between buildings. Inside their own building is supposed to be their own problem.
I receive an email. This infamous datacenter tells me they're experiencing an "ORL" issue. That's optical return loss. It means the connection is too shiny and too much light is returning the way it came, backwards, instead of going forwards through the fiber optic cable to the destination.
I tell them it means their connectors are dirty and to clean them with proper cleaning supplies. Fun fact: they do not have proper cleaning supplies.
Days later they follow up, telling me the issue is now a 10dB degrade. That gives me pause. That could actually be an issue between the buildings I would be responsible for. A degrade means the light going from one building to the other is too dim when it arrives, some of it has been lost. 10dB is not a small amount, it means 90% of the light is missing and only 10% is getting to the destination.
I show up. I start my troubleshooting by asking a lot of questions. The answers I get confused me. The equipment readings I get confused me. Finally I realize what's happened.
On a previous trip I told them they had bad patch cords, they would fall out of their plugs if you so much as breathed at them.
Following my advice, they replaced them with proper cables. So far so good.
But one of the circuits didn't come back up fully. They properly diagnosed the new cable was dirty and needed cleaning.
Mistake number 1- I told them the cleaning supplies they had on hand were terrible, and likely to make things worse. Standard cleaning solution is isopropyl alcohol. I was told that was a hazard and they needed to use an alternative cleaner. This alternative cleaner is a mixture of Propyl Acetate, an industrial solvent that is itself flammable and emits hazardous vapours that are also flammable. Ethanol, which is literally fuel. And- Isopropyl alcohol!
This cleaning agent leaves behind some oily residue that- causes ORL issues because it's shiny. So they cleaned this fiber over and over and failed every time, and concluded the problem must be elsewhere- completely ignoring that I had already told them their cleaning supplies sucked and were incredibly inadequate.
Now if their cleaning supplies weren't awful, that might be a reasonable conclusion. So they performed a loopback test.
The connection normally goes equipment, patch cord, rack, cable leaving building, rack in next building, patch cord, equipment. A loopback test is purposefully looping the patch cord back to the transmitting equipment by connecting it to itself at the rack.
When you do this, you add an attenuator. This is a special plug that adds loss. The system is set up assuming you'll lose so much light going from building to building, so that short range connection needs to be 'dimmed' a bit. That's what the attenuator is for.
Mistake number 2: they connected the attenuator, and forgot where they put it.
The 10dB degrade I was sent to repair was caused by a 10dB attenuator they installed, and couldn't remove, because they forgot where they put it. And on top of that- they still needed me to clean their damn panel for them.
The remote support team knew all of this, and through tactical lies of omission, made me think I might actually need to fix something that was my responsibility. Instead they used my competence to fix shit their internal team doesn't have the training or equipment to deal with: which are basic essential functions of any datacenter team.
These lazy bastards tricked me into troubleshooting diagnosing and repairing their own shit because they can't be bothered to train and equip their own employees to do their jobs.
I put my foot down and told them this is our last courtesy dispatch. For future calls involving this datacenter, we are working strictly to the terms of our contract.
Dav3le3@reddit
Cool story OP! Thanks for breaking down the jargon.
Reads like what an ELI5 should be!
armwulf@reddit (OP)
Part of my job is teaching the jargon. I have a large repertoire of simplified explanations.
bobbane@reddit
Dumb question- you mention a 10dB loss and explain it as a 90% degradation. Isn’t that a 50% degradation? In sound at least, a bel is a factor of two.
armwulf@reddit (OP)
In fiber optics, decibel milliwatts (dBm) are a logarithmic unit that converts to milliwatts as a measure of power. +/- 10 is multiplying or dividing by 10. (Exact) +/- 3 is multiplying or dividing by 2 (Close approximation)
If a transmitter is putting out +10dBm, and the receiver is catching -16dBm, that means the fiber between the two had 26dB of loss- which is dividing the input by 10 x 10 x 2 x 2- 1/400th the input reaches the destination.
Typical loss is about a quarter dB per kilometer + 1. So a 20km span would have about 6dB of loss.
ksam3@reddit
Alright, I'm gonna need you to ELI2, lol. Or 1 even.
armwulf@reddit (OP)
Sure. So we're shooting lasers through the fiber optic cable. That laser has a "brightness", that's the amplitude or power.
Power is typically measured in milliwatts for stuff like this. But it's also the same as say, horsepower. Literally. Power is energy divided by time. That's why torque and horsepower are different. Torque is how much force an engine can produce, horsepower is force per second. Horsepower and watts are like inches vs centimeters, they measure the same thing, just different units.
When that laser passes through something, like a connector, it gets dimmer. How much dimmer it gets is "loss" or attenuation. But that loss is a percentage, not a linear number. If I put in 10 watts and lose 1, that's 10% loss and I'm left with 9. But if I only put in 1 watt, I would only lose 0.1, and be left with 0.9.
Additionally, the fiber optic cable itself applies loss. How much varies. Typically about 5% per kilometer of length.
Now here's the challenge. You have a fiber optic cable that's 10km long. You remove 5% of the light per kilometer, but it's not 100%, 95%, 90%, so on. It's 5% of whatever remains. So it's like calculating compound interest in reverse. Now add splices, which have about another 3% loss each, and at least one connector at either end with another 11%. The math becomes really really complicated to figure out the final percentage of light that reaches the end.
So like I said before, we put the complicated math in the unit conversion.
That 5% loss per kilometer? That's 0.22dB now.
3% per splice? 0.15dB
11% per connector? 0.5dB.
And all you gotta do is add the numbers together. Really easily. 10km of cable? 0.22dB x 10 = 2.2dB 6 splices? 0.15dB x 6 = 0.9dB 2 connectors? 0.5dB x 2 = 1 2.2 + 0.9 + 1 = 4.1dB.
We love decibels because they turn a math problem that would need a degree and a graphing calculator into- basic algebra you can do in your head.
To convert it to a percentage, you can use factorization. Just remember for every 3dB you go up, you multiply by 2. For 10, 10. Going down is the same but dividing.
4.1, we can round that to 4dB. Going up by 10 is x10 Going down by 6 is going down by 3 twice, so we take 10, and divide by 2 twice.
10/2 = 5, 5/2 = 2.5. So a 10km span with 2 connectors and 6 splices has an estimated loss of about 4.1dB which is the same as saying 1/2.5th, or 2/5ths of the laser will arrive at the destination.
udsd007@reddit
Just like radio link loss computation. Exactly.
Breitsol_Victor@reddit
I wish someone had splained like this when I was doing M/W radio for uncle. Didn’t really need to know.
That_Ol_Cat@reddit
New Math.
crh23@reddit
In addition to what the OP wrote about dB in fibre, in audio a bel is also a factor of 10 - but it's a factor of 10 in sound pressure level. Human auditory processing is very complex, and adds another layer on top that means the perceived loudness doesn't at all move 1:1 with SPL
stelei@reddit
I could tell this wasn't your first rodeo when you said "the connection was too shiny".
I have no reason to ever use that phrase IRL but I am stealing it nonetheless.
ergo-ogre@reddit
Can’t help but think of Firefly
Cheap_Flamingo476@reddit
Throw in some Chinese cursing and it would be Firefly.
That_Ol_Cat@reddit
Pretty sure Op has edged into Chinese cursing dealing with these people.
WarlockyGoodness@reddit
As someone that’s spent 20 years in telecom, I empathize.
Also as someone that teaches many of these concepts, this is hilarious.
Keeping fingers crossed for you.
ChooseExactUsername@reddit
Great story. I used to be a "router jockey" and that brought back some memories. I don't think I've ever "lost" an attenuator or forgot about which fiber I was doing a physical loopback on.
(If you're in the UK it's a "rooter jockey")
Throwaway_Old_Guy@reddit
It should be billed heavily this time to ensure there is no second occurrence.
armwulf@reddit (OP)
Sadly it's all a pre-arranged contract with pricing sheet. But we tend to go "above and beyond" to keep them happy. This site has been exempted from that treatment. OSP only. No unnecessary site visits.
Gadgetman_1@reddit
And that pricing sheet doesn't have an item for wrongful callouts?
TheLazySamurai4@reddit
"That sounds too inconvenient for our customers, we might lose contracts to competition." - upper management probably
That_Ol_Cat@reddit
Nah, that's marketing. They'll sell any service for cheap if it means they get the bonus on the big deal.
fatmanwithabeard@reddit
One place I worked was willing to allow sale to forgo the wrongful callout charge listing, if the salesdroid was willing to take their commission over the life of the contract, with any wrongful call time deducted from that.
Man, I miss that place sometimes (it was hell)
armwulf@reddit (OP)
Spot on.
Gadgetman_1@reddit
Check your autocorrupt. Thatshould probably have been 'Upper manglement'...
Skulder@reddit
.... If I lied my way into getting a job there, how long would it take before they figured out I don't know my elbow from my ass?
armwulf@reddit (OP)
They recently fired one of their engineers, because they tested positive for meth.
Despite my protests, we are still using the meth addicts designed routes for major fiber projects.
The designed routes are not good.
KelemvorSparkyfox@reddit
I assume that meth is cheaper. So they can get higher for longer, and still have hooker money at the end of the day?
HelloAll-GoodbyeAll@reddit
Is there a noticeable difference in quality between work done by meth addicts and that done by cocaine addicts?
StorminNorman@reddit
Probably. Meth keeps you going farrrrr longer, and it fries your brain far quicker than cocaine, with the added bonus of making you dumber than cocaine would. Just a review article, but it does illustrate how neurotoxic meth is in comparison.
That_Ol_Cat@reddit
Also a true fact: Any meth production site automatically qualifies as a toxic waste spill.
Dramatic_Mixture_877@reddit
Thanks for the link! I have multiple sclerosis and reading about what drugs (especially meth) do to the brain made me think of the old skillet and egg commercial that said, "This is your brain. This is your brain on drugs.". They were literally correct - they need to start airing that on Saturday mornings and on kid-centric channels, like Nickelodeon.
Neuroinflammation is a big issue with MS - any inflammation is, but in the brain is big trouble. I always ask my neurologist to order a CRP test with the rest of my blood panels they do as an additional check on inflammation, not just relying on white cell counts. I'm in bad enough shape as it is - I wouldn't dream of throwing street drugs at my brain!
That_Ol_Cat@reddit
Maybe he has a home chemistry set and loves "Breaking Bad"?
anonymouslosername@reddit
Because you get so much more meth for your money? 🤷♂️
Arietam@reddit
Cool. So “designed by meth-addled monkeys” is actually a fair descriptor.
ergo-ogre@reddit
“Designed using M-AM protocols”
MasterClown@reddit
I really like this idea! I'm going to give it try :)
Oh... never mind
ZeniChan@reddit
I always like it when I go to a new data centre/server room/wiring closet and immediately see a length of yellow fibre cable coming from the core router and plugging in to orange fibre cable via a coupler. When I bring it up, they say just don't touch it. They ran out of fibre a year or two ago and never got around to fixing it since it's working, as long as you don't touch it.
That_Ol_Cat@reddit
1st rule of Engineering: If it ain't broke, don't fix it.
2nd rule of Engineering: If it's stupid but it works, it ain't stupid.
Neither of these rules, of course, is approved by OSHA!
armwulf@reddit (OP)
Frankly I'm at peace with that. But when you tell me you have extremely stringent performance standards and absolutely prioritize uptime, but your ISP technicians couldn't pass the CompTIA A+ and don't have proper cleaning supplies, the solution is not to use an external vendor to replace your internal team
dreaminginteal@reddit
Gasp! That would require spending money! And the internal team might get "uppity" and leave for somewhere that doesn't pay as shit wages.
Stryker_One@reddit
They forgot where they put the attenuator.... What else might they have forgotten?
Jwatts1113@reddit
Bet they remember where the meth is
spaceraverdk@reddit
As someone who has zero experience with fiber besides the one coming from the ISP to my house, I realise that having a fiber run in my house is iffy and bothersome compared to using copper.
Even though I have a couple of brand new fiber patch cables and some SFP transcievers I thrifted.
DaHick@reddit
I am only 1 up on you. The backbone in my house between my firewall and my basement switch is fiber. I don't know anything but plug it in, and it works. Today I learned I should occasionally clean the ends with Isopropyl Alcohol, but I don't know what percentage or what type of cloth. Off to go read up on it.
ohioleprechaun@reddit
I'm not OP, and definitely not a fibre networking guy, but you shouldn't have to clean them if they're connected. If you're constantly unplugging/replugging or leaving them uncovered when unconnected, you may need to clean them.
DaHick@reddit
It's my basement, I butcher wood down there and do some technical stuff. I am an automation and controls engineer, and my home lab is down there, thus the need for the fiber backbone. SFP is not a sealed connection, so I may need to clean it.
nagi603@reddit
Use DAC cables then. Those are sealed units with adapters already on both ends. Some are copper-based, others optical fiber.
ohioleprechaun@reddit
I retract my comment then, you probably need some sort of regular cleaning schedule. And maybe a sealed box to put it in, but you will know more about that than I do.
DaHick@reddit
Yeah, I'm considering an IP 60 or 61 box, but then I have to do heat rejection. Was not thinking of using those skills at home.
https://rclite.com/wp-content/uploads/2021/07/IP-Rating-Chart-scaled.jpeg
armwulf@reddit (OP)
We typically use 99% but you really have no reason to touch it. The only reason you would clean it is if something goes wrong or if some idiot touches the end with their oily greasy hands.
Honestly if you have a medical kit with 70% alcohol swabs used for cleaning injuries those would do just fine for your purposes.
DaHick@reddit
I still have 90% around for when I was cleaning tape heads, so thank you.
armwulf@reddit (OP)
Honestly I would still use the swabs. The biggest thing is using a cloth that doesn't leave behind even microscopic debris. Those medical swabs are actually really good for that reason. You can't just use a normal cleanex or cloth. We typically use laboratory Kimtech wipes but that's overkill for you.
DaHick@reddit
Nope, got some. They have other uses 😊
armwulf@reddit (OP)
Fair enough. Polish in circular motions, use wetted wipe first, then dry. Don't touch the same part of the wipe to the connector twice. Rotate your thumb to the side as you wipe to ensure fresh wipe is used with each motion.
DaHick@reddit
Thank you again. Sounds almost exactly like tape heads, but we could get away with just cotton swabs.
brawnyfrogmouth@reddit
no good deed goes unpunished
Astarothian@reddit
...is this in Texas by chance
armwulf@reddit (OP)
It is not. East coast.
Astarothian@reddit
Ah. I just hear stories from one of my buddies who works at a datacenter there on how they are mostly using trainees to set everything up, and its basically gas station workers turned technicians.
WorkWoonatic@reddit
Sometimes I wonder if it would be difficult to get a datacenter job, then I read posts like this and wonder if I'm overqualified.
chg1730@reddit
I sometimes get called into datacenters for stuff and if I miss my shoe covers I get reprimanded for possible contamination on fibres. These guys were just doing whatever and still work there, damn.
Should technically be able to measure in between each connection point and see the attenuation no? Still a PITA and not your job but would be easier than tracing the entire cable.
CupidStunt28@reddit
Ethanol, which is literally fuel. And which burns with a colourless flame, so they won't even notice they're on fire until things start getting extra toasty.
Candid_Ad5642@reddit
Yeah
Your contract needs a phrase with something about if they call in an incident they claim it's within your responsibility but you can demonstrate the fault within their bailwick abcd they should have known, you invoice time and materials at "punitive rates"
apnorton@reddit
Wat.
Realistic-Muffin-165@reddit
Yeah the meth addict probably drank it from the sounds of things.