Hacker Newsnew | past | comments | ask | show | jobs | submitlogin
Weirdly broken Wi-Fi access points (kmjn.org)
178 points by mjn on Sept 13, 2016 | hide | past | favorite | 173 comments


For those who are curious, there's a bunch of reasons that contribute to why consumer wifi routers sucks ass:

1. Wifi routers are very complicated. You need smart people at all levels of the stack to build a modern wifi router. You need people who know 802.1ac like the back side of their hand, people who know how to set up and deploy linux environments, embedded engineers for debugging OEM driver issues, networking gurus for handling the voodoo in levels 2-4 of the OSI stack, application people for rolling the user interface, cloud people for cloud support. Normally this isn't a big deal, if it wasnt for the next point

2. The profit margins on consumer routers are complete trash. Even if you're one of the big boys with double digit market share, you're going to have a very hard time keeping a decent engineering team staffed and your marketing team staffed at the same time while still breaking even.

3. Consumer router sales are SKU driven. There are dozens of price and performance points you have to hit to meet the demands of the consumer market. You cannot make be profitable with less than 10 actively selling SKUs. Every time you release a new SKU, it's a new opportunity for marketing to try to sell the device to brick and mortar stores that they're trying to expand into. If you aren't releasing 5-6 SKUs each year, you're going to have a very hard time keeping your router on store shelves.

4. Since sales are SKU driven and your engineering team is probably under funded, you have the exciting problem of maintenance releases. If your company has 50 supported SKUs and you find a non-driver issue in one of them, the chances are that it affects 10 other SKUs as well, if not all of your other SKUs. Pushing that maintenance firmware to 50 SKUs could easily take 6 months of combined QA or Firmware development time. As far as your marketing department is concerned, all that time you spend on maintenance releases is time that isn't spent on making new SKUs with exciting new features.

tldr: get a business class wifi router


My experience is that most consumer routers are actually developed and maintained by teams at companies you've never heard of overseas. This is partly as a response to the engineering challenges you've outlined.

Taiwan is a popular choice, due to the combination of proximity to manufacturing in China and an English friendly business culture.

Every so often the big brands go shopping to build a new product and they select something that meets their needs from the white-label products available.

The product then gets customised for them. What the big brands contribute is industrial design of the case, some qa, packaging, ui styling, a distribution channel and consumer trust in their name.

Which leads to funny scenes like watching engineers at a white label do board relayouts and emi retesting because the industrial designer wanted the antennas to sprout out of a different bit of the case. Because it looks more badass that way...

This also leads to the phenomenon where say, v4.15 of a router was cool, but 4.16 sucks - because internally its a totally different chipset, with a different os (e.g. vxworks vs Linux). Ref e.g. WRT54...

The reason this happens is not because the brand name rewrote anything - they just selected a different white label partner who customised a totally different product to look the same (probably because they promised to deliver cheaper for same SKU). This then gets sold under the old, popular model name.


This is why I was happy with my Apple products for years. No matter what kind of third-party thing I'd buy it would end up being a piece of trash after a little while. Apple Airports tended to just keep working for my family.

Too bad the new ones are ugly as hell. Oh well, my older model still works.

And these days so many people get it through their cable provider (or DSL, whatever) who have a marked interest in not generating support calls means you may get decent equipment.


> And these days so many people get it through their cable provider (or DSL, whatever) who have a marked interest in not generating support calls means you may get decent equipment.

You would think that, unfortunately it isn't true. Usually those ISP provided combination routers are completely outsourced - as in their hardware and firmware are both made by a chinese OEM overseas.

Believe me when I say there is nothing worse than the firmware on those chinese routers. They'll grab a bunch of open source software, reskin it, plug the chipset's reference driver into the OSS, then do a basic sanity test and ship it. You get what you pay for, and the ISPs want to pay as little as humanly possible.

I don't know what they do about the support. I remember one time we brought in an ISP's combination modem/router for performance tests against our device lineup, and it would crash if you ever tried to pull more than 40mbit/s through its ethernet ports.


Having worked at an ISP, the problem is that pretty much all the routers/modems at price points consumers will accept (because an expensive router, even if leased, means a higher monthly bill) suck.

They all work really well in the testing lab, where there's 50m of cat5 between the DSL modem and the DSLAM on the floor below. But then you find out once it's deployed that they have big issues with certain kinds of interference, so for a small segment of your userbase they have more dropouts than other routers. Since the company has already bought tens of thousands of them at this time, it's not really practical to just recall all of them and buy new ones. So the ISP files a bug with the upstream vendor. After some months of pushing, the vendor gives the ISP a special firmware build that's supposed to fix the problem. The fix never gets into the official firmware line (presumably because it has other downsides that the mfg doesn't want to talk about, maybe it's more conservative in line measurements etc), so now the ISP is locked into running this custom firmware version. Oh and that one probably has its own bugs that the older one didn't.

I saw this happen so many times, with routers from multiple vendors over my years doing this kind of thing. I don't run consumer networking equipment in my home anymore.


Fully agree with the comment about the quality of Chinese firmware. I spent the best part of a decade working for a UK company producing reference designs for mobile chipsets, which we sold to many Asian clients, and I lost count of the times I had to travel to the Far East to sort out the mess they'd made of the software.

Their philosophy seemed to be to throw as many inexperienced graduates at the problem as they could afford and eventually it would somehow work - as though an infinite number of engineers would eventually produce the works of Shakespeare!


I've never understood Shakespeare to have much renown as a software engineer, though I suppose I've never understood him to be a particularly poor one either.



I'm aware of the theorem, but I was being willfully ignorant to achieve a primary goal of making a joke. The secondary goal was to point out that the engineers the Chinese are hiring are probably better at programming than Shakespeare was. Regardless, the tone of my comment was in line with that of the comment I was responding to -- surely anybody hiring engineers would not judge the venture a success if they found the engineers had reproduced a number of plays from a particular Victorian playwright.


Elizabethan not Victorian.


Whoops, good call.


I switched cable internet providers because the original one (a large incumbent here in Toronto that rhymes with Ogers) wanted to force me onto their combo wifi/modem (the other large incumbent, rhyming with 'Ell, has the same constraint). I absolutely refuse to use an integrated device from the carrier at this point because I have experienced the crap that they use previously. Give me a piece of equipment that does one thing well, and let me pick the most reliable in-house networking equipment myself.


Standard procedure here is to ignore the device's onboard wifi and plug in your own access point. Don't tell them, or they may whine about it being unsupported.

You don't have to use the integrated access point just because it's there.


Sure, but you still wind up dealing with the craptastic device. The router I currently use which is forced upon me by my ISP will hang for a few seconds if it encounters a DLNA packet. That means several seconds of packet loss and some TCP connection being closed.

I have wasted so many hours on the phone with their support without getting anywhere. They wouldnt even acknowledge there is a problem until I showed them graphs. Then they sent an "engineer" over who also knew fuck all and accomplished nothing. He then seemed to have reported back there was nothing wrong and now they ignore me.

I should switch ISPs but the performance of the network compared to the price is excellent. Just this shitbox of a router.


How do they force the router on you? Is there really no possibility of using an alternative?


At least in Germany we had the so called "Routerzwang". Some ISPs allowed third party equipment, but many didn't. You'd get a modem / router / AP combo device and the ISP had complete control over the firmware. Since there was no competition, most ISPs didn't care about the quality of their software or even security updates.


Thankfully in the UK all the (cheap) ADSL suppliers I've used have provided a modem+router+AP in which you can just read the access credentials off the config screen. No need to spoof MAC or anything when setting up a new 'router' just use the right credentials.

Now I'm wondering why, whether it's legislation or the way our ISPs relate to the infrastructure.


Primarily because our phoneline also comes from the same ISP which has to use their shitbox. They don't give you the credentials for that, you get a pretty configured router. It's possible to root the box, steal all necessary credentials and clone the mac but that is a huge hassle and obviously not supported.


I had similar issues, and I complained enough that my ISP gave me a second, just plain stupid modem and a splitter, so now I have their fancy phone/modem/router/wifi one just for the phone, and a simple modem only box for my actual internet. Works like a charm now!


Interesting. I am moving soon so I have given up the fight for now, but I might have to aim for that after the move. Thanks for the tip.


Just keep in mind, their "first tier" of customer support probably can't authorize that kind of thing, and probably won't even know what the heck you are wanting. I complained about the crappy slow internet and bad ping until I got to their actual tech guys, and told him the older modem did better, and he was like "Well, why don't you use one of them?", I said "Need the phone", and he told me to just put a splitter on it and use both, he'd set it all up and have the local office put one aside for me. I only even pay rent on the one modem, so it's actually an amazingly good setup, other then meaning I've got the start of my own "commercial grade" networking setup attached to a wall in the basement, with the modems, the router, a switch, and a server.


In the netherlands providers has linked the wifi enable settings in their modem/router/accesspoints to their online account and if you disable the wifi you won't get access to the free networks they deployed everywhere.

This is mainly because those networks are deployed by adding an second SSID to the router with WPA enterprise auth. The solution many people choose is putting the router in the basement with a lot of aluminium foil around it.


You can also put a heavy resistor inline of the antenna.


If you can do that, you could just unplug the antenna.


But they want the router back if you switch provider. If you void the warrenty then I have to pay for it.


I don't see why this is a problem. You want to free load on other people's connections but not share your own? (In the US anyway, the free wifi you get for running that secondary SSID just comes from other people's personal connections).

This is why we can't have nice things. I hope they find a way to defeat people doing this, but my guess is it would be prohibitively expensive (war driving entire cities, etc).


The problem is that I don't have the bandwidth for it. If a single user connects to the free accesspoint and starts skype r a youtube video then I cannot use the internet anymore.


I had this problem with my Optimum router. I actually got better performance by throwing it in the closet and using a $30 dollar piece of crap instead (which has since been replaced). Sadly, it seems that everything in the consumer networking market is a trap.


Wrap the router in foil?


That's unfortunate. I never really use the Wi-Fi on my Comcast box so I don't know if it was any good. I've had great experiences with people Google fiber boxes, but that's Google and isn't exactly a mass deployment so…


> You get what you pay for, and the ISPs want to pay as little as humanly possible.

More a case of you get what they (the ISP) pays for (or doesn't pay for).


Yes I believe such cheap routers are just copies of the reference designs that the leading manufacturers publish.


Too bad the new ones are ugly as hell

So you're going to have to start putting your networking gear away in closets, behind doors, like the rest of us ;)


But it's an apple product which means it's a fashion accessory.


For home usage, high end consumer routers with free firmware work just fine (something like Linksys WRT1900ACS with DD-WRT / OpenWRT).


Admittedly I haven't tried the newest models, but my experience with Linksys hardware and either ddwrt or tomato was pretty bad. Run for a couple of days and then require a power cycle. Seemed like a hardware issue.


I'm running tomato on a WNR3500L/U/v2... uptime is 134 days and I'm pretty sure the last bounce was a power outage. So, ymmv?


WRT1900ACS is pretty good with DD-WRT (uptime can be until next firmware update). Linksys also sell special set of antennas, which improve signal range (costly but can be worth it depending on your situation). I think they target it for network enthusiasts, rather than very wide market, so they put some effort into quality there.


I've had the same experience with DD-WRT on TPLINK hardware. DD-WRT has great features but always seems to require a reboot every 1-2 days :(


Counterpoint: I run dd-wrt on a TPLINK el cheapo something at the office, to connect all the boxen and let me SSH into each of them (port forwarding 19 to 22 on box1, 20 to 22 on box2 etc). It Just Works, never had to reboot it in four years now. (WiFi is off though.)


Likewise. I ended up reverting a couple of my Linksys APs and WRTs back to stock firmware and living with the lack of features...


dd-wrt has gotten pretty bad. They don't publish new releases anymore you just pick a daily build from one of the random people who publish builds and hope it doesn't brick your router. Learned this after bricking my router :(


I think their main developer (Brainslayer) publishes his builds periodically. So those aren't random people. But in general it's not really a fully open development project. I.e. it's not governed by any community.

OpenWRT is better in this sense, but even there some people split into recent LEDE project: https://www.lede-project.org

Regarding bricking. Linksys WRT1900ACS is designed with that in mind. It has two partitions, and you always flash updates to another one, so if something goes wrong, you can easily switch to the other partition which will hold previous installation, using special on / off sequence. It's a neat idea, and more routers should follow it.


I have the 1200AC it's got a 1.3 mhz proccessor. Not sure if it's as good as the 1900


WRT1900ACS has 1.6 GHz dual core CPU (Marvell Armada chipset): https://wikidevi.com/wiki/Linksys_WRT1900ACS


Are there any specific business class routers you would recommend?


I like the Ubiquiti gear. The AC lite is affordably priced. I'm also happy with the ERLite router.


It should be noted that Ubiquiti APs have no router functionality, they're just plain access points. Which is the way I like it, but might not be appropriate for many use cases.


I bought myself an Ubiquiti Security Gateway and the AC Pro access point. Works really well and the total cost wasn't all that much more than, for example, an Airport Extreme…


The best part is the fq_codel support :-)


Unfortunately it really limits performance, to about 60 Mbps on an ERLite.


Yeah, it disables all the hardware offloading unfortunately. But if you really need it because of a slow internet connection, I guess 60Mbps is good enough.


Their external antennas are awesome as well. For home use I prefer the Mikrotik Gigbit routers.


i actually edited my post to include the thing about openwrt/ddwrt.

If you need stability, go with a high end consumer router and load OpenWRT/DDWRT on it. OpenWRT and friends are very stable, but don't often take advantage of advanced features like automatic channel selection, traffic prioritization, or beamforming.

If you absolutely need features, go with a ubiquiti AP. My only issue with them is their hardware leaves a lot to be desired, and is rarely powerful enough to cover a full home, this isn't necessarily ubiquiti's fault though.

Business class APs expect you to set up multiple APs in the office, so their hardware is typically a lot less powerful than what you'd see in a consumer wireless router. A ubiquiti AP will gladly cover your 1 bedroom apartment, but don't expect it to cover your 3 bedroom home


I have a pair of AC lites covering 4200 sq feet across 3 floors. One is in the upstairs hallway and the other in the basement ceiling.


I have a single unifi Ac pro covering 3700sf and four bedrooms extremely well. (Two stories, not 3700sf on one level).


The difference between your and GPs experience probably comes down to concrete vs. wood housing. WiFi, like most radio signals, has really poor propagation through concrete.


Hah. Same here down to the square footage, except I have the n version.


I would love to use {Open,DD-}WRT, but it seems there's generally not great support for DSL modems :(


I ended up just keeping my ISP router as is, and putting a DD-WRT router behind it. I suspect the double layer of NAT isn't great, but I've been doing it for years without issue.

DD-WRT has been pretty solid for that, even in non-ideal network setups (currently have it connected to the ISP router as a 2.4GHz client, then acting as a 5GHZ AP for my other devices).


Are you not using the isp provided router as a modem only? Why not do pppoe (or whatever) on your dd-wrt?


Not all offer PPPoE, for some unfathomable reason.


Use a separate modem and router. If you can disable NAT on your current combo unit then you already have the modem.


Personally I really like the new Draytek models (like the 2925 and 2860 series). Lots of nice features like central access point management (Draytek APs only) and central VPN management to configure remote Draytek routers are great features on top of fail-over/load balanced multi- WAN and even a model with built-in LTE modem as a WAN link. They're intended for always-on/highly reliable situations. You can even pair 2 units into an HA cluster.

They're very popular in Europe but oddly never seem to have had the same impact in North America, although you can find them if you look around a bit.

They did do a linux based variant at one point but switched back to their custom rolled OS a couple of years back. Not sure of the story there or if it's possible to do a custom firmware. There was something on google code at one point.


pfSense running on an old laptop or other lightweight system, along with a VLAN-capable switch and a Ubiquiti or Cisco WAP371 wifi point. Bridge pfSense to your modem, and you have a full stack. You can run your pfSense box on a stick via VLAN-ing, so you only need one interface.


Just be aware that if you're using any flavor of Comcast in the US (Business or Consumer-class), "Bridging" your cable modem will make it cease to work and require a factory reset. Anecdata from me, from sites in Boston, Seattle, and Portland.


Personally I prefer at least desktop hardware for my pfSense boxes, with a physically separate WAN and LAN port. Maybe that's just because I'm bad at VLANs.


The benefit of a laptop is that you have a built in UPS. A laptop is also going to consume less power than most desktops.

As an aside, if you're after a nice, fast, low power solution, you can also look into the Netgate pfsense hardware, eg, http://store.netgate.com/ADI/RCC-VE-2440.aspx.


Pfsense routers from the pfsense store


Aruba or Rukus avoid dlink like the plague


Get a Meraki for free https://meraki.cisco.com/freeap


I find most consumer-grade Wifi routers make more than adequate ACCESS POINTS, but they suck at being an actual ROUTER. My TM-AC1900 (rebadged RT-68U with a T-Mobile logo on it) would need to be restarted daily, after I finished setting up my homelab two months ago I put a Ubiquiti EdgeRouter X in front of everything and just connected the old router to my switch and put it in AP mode, I haven't had my network die out on me and require reboots in months (with the exception of my cable modem going out).


Or get a sane router:

https://omnia.turris.cz/


The problem with Omnia is that it's a niche product that is very unlikely to ever sell outside of its niche.

Also, it's being made by people who haven't made a router before. 99% of the issues you'll ever have with your wifi router are driver issues. Omnia is using OpenWRT for their base, however they'll still have to make the chipset driver work with OpenWRT, and at a reasonable speed.

Even if they get the driver working, it's unlikely that the device will be running at speeds comparable to other market competitors. It takes lots of unpleasant driver hacks, custom networking stacks, and prioritization magic to hit the benchmark numbers that Asus, Netgear, and Linksys push out.

I'm not saying you should write off the Omnia completely, I just recommend sitting back and waiting for it to be released/benchmarked before purchasing it.


What about AVM products such as a Fritzbox (https://en.wikipedia.org/wiki/Fritz!Box) ? Primarily a german product but also available internationally (e.g. Australia)


The Fritz!box is the only router I recommend to friends. I've run into few problems with them, except persistent issues with the web interface responsiveness. The only real problem is that they are relatively expensive. It's hard to convince people to spend 4x the amount because it's "better".

I think most non-geeks don't realise just how bad their routers are making their web experience.


In my experience, the wifi performance of Fritz! boxen sucks. Is this still the case?

I ordered a Netgear Nighthawk R7000 and I'm very happy with it.


Well, that depends on your perspective I guess. For most Aussies stuck on ADSL (real world max speed of 20mbps download, on a good day, close to the telephone exchange), their WiFi performance is rarely the rate-limiting factor.

I haven't seen a head-to-head comparison.


Why bet on a risky croud funding campaign when you could just go with a Ubiquity.


I bought a Ubiquity a few months ago, and found out that it couldn't communicate over wireless with a Raspberry Pi's USB-WiFi dongle (with drivers in mainline kernel) sitting 3 meters away. Before I bought it, the same dongle was acting just fine as access point for the whole home network of 3-4 phones/tablets and 1-2 computers; it could easily handle browsing, updating Android/iOS apps, and YouTube video streaming. And it still is doing that just fine after I sadly had to return the Ubiquity!

Now of course this is just my own anecdote. But it remains a fact that WiFi setup can be incredibly hit-or-miss, and there's no reason in principle why two devices 3 meters away should have occasional 2-3 seconds packet delays.


Sounds like you just configured your Rpi wrong.


I didn't need to configure it at all, it just picked up the SSID.


cz.nic isn't risky in the slightest. Ubiquiti don't ship with open source firmware.


It has nothing to do with the firmware it has to do with the fact that they haven't delivered a single working router to a customer yet. You don't seriously believe that buying something through a crowd funding campaign carries zero risk do you?


In this case yes, funding cz.nic has zero risk. They just sent an email saying the first shipments will start this week.


As a product, thats what I would have bought had it been available. Why have a weak router and a weak NAS if I can have a decent combined device with the same power consumption?


I think the problem is not with the actual IP routing part, as that is relatively straightforward, but with all the other value-added "features" that get crammed into the same and often underpowered hardware. At the least, a consumer WiFi router will contain an IP router, a NAT, and an AP, and it's the NAT that probably contributes a lot to the connectivity failures people are seeing that go away when the "router" is rebooted --- NAT tables filling up due to many half-open or lingering connections that didn't get closed properly. The aggressive nature of many applications' connection usage (e.g. "why does my router stop responding when I use torrents?") doesn't help either.


It's not just terrible NAT, although that's popular. I already do NAT on a PC server, so I run wifi routers in access point mode, and still need to do periodic reboots because the wireless interface stops working.


I've seen wifi AP+NAT+Router units that choke on 100 concurrent connections. Some of these are absolute trash.


Is there any specific reason why consumer grade router companies (like D-Link) drop their own OS entirely and shift to a open-source alternative? That should bring down development costs somewhat I'm guessing and force them to make different boards compatible.

Not a hardware guy, but I've looked at D-Link firmware code in the past, and it is just utterly shoddy. Just making the hardware and making sure it is compatible with some router-os should be simpler. (Essentially what happens in the mobile/laptop industry with android/windows).


Maybe not exactly what you want, but Netgear has a fairly wide range of routers with open source firmware support and have a dedicated site [1] for that. I've had good luck with the previous 2 Netgear routers (WNR3500 and WNDR3700) and will probably buy another one again in the future.

[1] https://www.myopenrouter.com/


It's pretty obvious that 2 and 3 are a big problem. I'd pay significantly more for a router that I can guarantee works reliably (cost doesn't seem to make a difference for consumer router reliability); find a way to advertise that and I'll love you forever. Make packet loss and latency on a variety of popular hardware / OS (with default settings) THE key metric.


Protip: if you are willing to pay more for reliability, buy Ubiquiti (or other enterprise class) hardware. It's actually not that much more expensive, and you get great features like PoE, design/color ment to be invisible, and great handover between multiple APs if you have a large house.


That's precisely what we're doing at eero. We have engineers focused on each part of the stack (wireless firmware + drivers, core OS, application layer, mobile, cloud, and data) and are pouring our efforts into a single sku. It's time that people had the network they deserve...especially since they run our homes.


> Consumer router sales are SKU driven.

Why is this? It sounds like having recognition of your model numbers is a bad thing, if you need to keep revving them so much.


People rarely recommend routers by their model number, if they recommend a router, they recommend the brand.

It's very hard for marketing guys to call Costco and say "hey, you remember that router we tried to sell you 6 months ago? well we just gave it a firmware update!", and close the sale

it's very easy for marketing to call Costco and say "hey we got this fantastic new router you guys might be interested in stocking", and close the sale


There are exceptions - the BT HomeHub and now SmartHub have always been rock solid for me.


My BT HomeHub5 has been pretty unreliable - regular dropouts of Wifi from some Macs, similar to the DHCP thing that OP described. Then monthly reboots when everything stops. Performance is pitiful, but I blame that on the ADSL1 :)


Do you have any recommendations?


What business class router do you recommend?


Draytek Vigor 2860 range:

http://www.draytek.co.uk/products/business/vigor-2860

I finally gave up on consumer routers, and bought one of these instead - the learning curve is steep, but it's overloaded with features and runs brilliantly.


My experience using draytek is that the documentation is non existent and the small amount that does exists makes the exotic bits of cisco documentation as clear as day.


Well, yes. It's not easy, I did say that, but once you get past the knowledge hurdle, they are very good bits of kit.


I think there's also an issue of easy diagnostic. It's extremely hard for non-tech people to assess the quality of the network; they basically have to rely on "time to open google" which is obviously a complicated metric to rely upon, and can be measured only with a manual test.

Wifi has a single quality indicator, the "signal power" which is useful but doesn't say it all (it doesn't even take SNR into account). Somebody should come up with a monitoring algorithm, possibly mixing radio stats and stability of pings, that converts into a simple indicator on the UX of all operating systems. Something like green/yellow/red. Once people know their connection is "always red with MacBooks", they will fix it; hotel managers can ask for support from suppliers, airbnbs can go to the mall where they bought the router and complain.

This would be up to the Wifi Alliance to fix, but they are the worst slow-moving committee ever seen on this planet, so I'm not holding my breath. They're probably implementing a new useless encryption algorithm that will be broken in 5 minutes and stay broken for another 5 years till they agree on something else.


This begs a question: could Wifi Alliance be replaced on this one? Assuming open source software (Linux PC + OpenWRT router), could this be implemented manually?

I once contacted Wifi Alliance regarding this one: https://github.com/d33tah/call-for-wpa3

This was their response:

  > Thank you for contacting Wi-Fi Alliance.
  > 
  > In regards to your question about work on security developments, there is a member task group that is working to advance Wi-Fi security enhancements. You can find all the current work groups on the Wi-Fi Alliance website at http://www.wi-fi.org/who-we-are/current-work-areas.
  > 
  > In order to participate in discussion and work groups, companies must become a member of Wi-Fi Alliance.


If you do this, please make it a continuous scale rather than green/yellow/red. There's a big difference between "the worst that connection can get you a green" and "a really good connection".


I suggest we express it in dBs. :^)



I wonder how many of those could be fixed by simply rebooting the router.

I've had numerous problems with my own Time Warner Cable modem/router. It'd always become pretty bad after about 3 days of operation, dropping my speed to about 2% of what it was initially. I spend months debugging the damn thing, using a custom router, swapping cables, etc, and nothing would fix it.

Until I got a timer and got it to reboot every day at 3 am (the https://xkcd.com/1495 route) and never had the problem again.


Rebooting a router is not always an option.

Both examples are from "public" wifi where the person with the problem is unlikely to be the only user.


Probably a 5 minute downtime at 4 AM is overall preferable to a continuously unstable connection. Of course an entity like a hotel should just pony up the money for better hardware, but I could excuse that for an (Air)BnB or guest house or vacation apartment.


And for guest houses etc the default password device is often unchanged you could always login and reboot for yourself.


Was a 2% drop really that noticeable?


I think OP is saying it dropped to 2% from 100%. If you remove the "about" and think about it as an estimate, and focus on the "to"... it makes more sense: "Dropping my speed to ~2% of what it was initially".


Yes, this.

My download speed, measured after a reboot, was 50mbps. It would then drop to ~1mbps after a few days and stay there.


Here's a theory based on a bunch of un-substantiated and un-researched facts:

Some routers do not allow traffic from IPs which are not active DHCP leases. When DHCP lease expires, they block traffic. But many routers don't enforce this.

Some routers give out super short DHCP leases - as short as 30 seconds. But the typical length is 12 or 24 hours.

A public or semi-shared wifi router is more likely to use these policies to prevent exhausting the local subnet address pool, which is typically around 254 addresses.

Mac OS X doesn't believe that renewing a DHCP lease in 30 seconds is ever really necessary, limits the renewal frequency to once per minute.

I'd check the dhcp lease time.


An intriguing and plausible theory! In my case though it seems that the lease is 86400 seconds (24 hours), or at least that's what the router is claiming to give out. So I don't think this is it. My guess is that renewing the DHCP lease has some side effect of resetting the session, rather than it being the actual DHCP lease renewal that is important.


Are you serious? A 30 second lease is insane.

I've seen DHCP servers that TAKE 60 seconds to give a lease.


Was that the server, or the client waiting that long? THe DHCP spec requires the client to broadcast a request, then wait for a while (unspecified) and collect answers, then choose the best answer (metric unspecified).

Sometimes when I write DHCP clients I choose the metric as "fastest", then just choose the first answer and don't wait for any more. And it takes on average a few milliseconds to get a lease.


It wasn't the client, but beyond that honestly I don't know, I didn't get into the nitty gritty of it. Could have been a terrible or overloaded piece of server hardware, or maybe a network loop or something else misbehaving between the client and server.

We built a portable device that starts a DHCP client and tries to get a lease on an Ethernet plug event, and if no lease is acquired after a certain amount of time, the device assumes it has been plugged directly to a PC (or an "ad-hoc", isolated network with a switch and no router) and will kill the client and start a DHCP server itself, so you can access the device's webserver directly. I wasn't onsite, but the solution was simple - we just increased the timeout to wait for a lease before starting the server.


Depending on what switch you're plugging into, this could easily be Cisco Portfast or similar catching you out.

When the port goes active (layer 2 link) the switch inspects but doesn't forward packets to try and ensure you've not just added a loop into your ethernet network. After a minute or so, if everything looks normal, the switch then starts letting the device talk to the rest of the network.

If you don't know about it then this can be annoying at best as you seem to have a layer2 link but DHCP can seem to take an age to start working :)


I'd note that DHCP broadcasts are unreliable and could simply be dropped. Then the client is responsible for retrying them.

Except that packets don't really get dropped for wire errors at all anymore. Not unless the bandwidth is oversubscribed and there's a collision (really a buffer overflow on a router).


Not sure how it works on Linux, but on Windows, if the acquired DHCP IP has a lease time of 60 minutes, then Windows will try to renew that IP anytime after 1/2 the lease time (i.e. 30 minutes). This is why in Corporate environments good practice is to keep lease time 1 hour or more.


Definitely unsubstantiated: If this is just at an AirBnB, it's probably not going to be a commercial wifi system.

On your topic, though, Princeton noticed something similar with regards to iOS and DHCP, although it is several years old now: https://www.net.princeton.edu/apple-ios/ios41-allows-lease-t... (last tested version was iOS 6.1)


I wish he would drop the name of the router he's running into these issues on. I have the same exact issue with my laptop (random disconnects for 30 seconds) only on my home router, and I've been considering replacing the router with something of higher quality.

I use an ASUS 750N, which is starting to show its age faster than I expected, but otherwise services the devices in my house just fine. Except for my laptop, which is the only device I actually need to use over wifi on a regular basis.


I don't think the model is 750N, more likely that's the max speed. Probably you have the 56u or 66u. The Merlin firmware for Asus routers has really improved my experience with the ac-87u. It keeps the stock UI and fixes bugs/adds features. https://asuswrt.lostrealm.ca


asuswrt-merlin is awesome. I replaced my ISP's shitty combo router with a barebones TP-Link DSL modem and an Asus AC5300 (which physically looks ridiculous) running asuswrt-merlin. It turned one of the more annoying parts of my day into, eh, nothing. It just works flawlessly for months on end.


You're quite correct, the 750N was from memory. The actual model is an RT-N65R.

That firmware looks lovely! I will give that a whirl when I have time to dink around with it, possibly this weekend. Thanks for that!


Happy to help out :)


For what its worth some hotels have been known to intentionally degrade network performance / access to push people to the hotel supplied service. That has included screwing up people trying to stream video into their room via the network rather than buying the PPV movies that the Hotel provides. Either of these failure modes have a good chance of degrading the viewing experience of streaming video sufficiently to make it unusable.

If you are in a hospitality situation you can demand a refund on your Internet charges, or if the Internet is "free" consider asking for a discount on your bill.

That said, power line networking (the author mentioned devolo) is notoriously fraught with challenges. A company I helped start was acquired by the folks who put power line networking on the map (Tut Systems) and their were a lot of interesting interference sources they tried to mitigate. Inductive loads (motors) coming on and off was a big challenge, and places like hotels would have large fans that would circulate air through the common areas or hallways. Fluorescent lights were another noise source.

All in all it was a poor excuse for a network and everyone was amazed they got a megabit per second point to point through it reasonably reliably.


My own Linksys router (I think it's a WRV200) occasionally starts to silently drop new connections until I disconnect and reconnect. If my browser has an http/2 connection open, I can continue to browse that particular site, but when I navigate to another site, the browser times out. It seems to only affect one device at a time.

Curiously, the behavior became dramatically more frequent (from once every few months to several times a week) when I moved in with family, which meant plugging it into a different modem, and adding a bunch of Windows devices to the network.

(When I move back out, I'll probably leave this router behind and get myself something that runs Tomato.)


That sounds like the NAT table has filled up.

Incidentally, this is often the cause of problems with "routers" that people experience --- it's not the (IP) routing part that's giving trouble, as that's stateless and involves not much more than packet forwarding. Things like NAT do involve state, since the router has to assign a port mapping and keep track of (TCP) connection lifetimes. If connections aren't closed correctly or the router misses detecting them for whatever reason, the port mappings will stay in the table until they time out (which may take hours or more) or the "router" is rebooted. The problem becomes more frequent the more devices there are on the network which are making connections.


Every free wifi I have ever been on has been broken like this in one way or another, crazy ping times, randomly dropped packets or just extremely slow etc. In fact I am writing this right now from an Amtrak train but tethered to my phone since the free wifi is so bad.


If it's truly free (and often even otherwise) there's probably a dozen or more others sharing the same access point as you.


I think Amtrak uses cellular connections to the outside internet. Depending upon the part of the country, there is a good chance that the weak point is their link to the cellular network.


It seemed like Amtrak uses Verizon for backhaul, at least on the Lake Shore Limited. It definitely goes through some spotty coverage areas.

Personally I couldn't care less about their wifi though - I appreciate the power outlets much more.


MacBooks have issues with WMM (WiFi QoS). I've seen my connection drop for dozens of seconds at a time when I enable WMM on my router. Selectively disabling it for my Mac (leaving it on for my wife's Lenovo and my HTPC) fixes the issue.


Thanks for tip. Just turned it off. Hopefully it will solve the issues which I have just on MacBook, iPhone works OK.


The details are vague but if I remember correctly, this is due to something OS X does with DHCP that isn't quite standard but apparently helps to make connection resumption faster --- when it works, that is.


I seem to remember this. Was it that OS X started using its old lease while waiting for it to renew under the assumption that it would probably get the same thing and be fine?


That's almost exactly the case.

Link here: http://cafbit.com/entry/rapid_dhcp_or_how_do

Previous HN discussion here: https://news.ycombinator.com/item?id=2755461


These aren't even that bad.

I've had a router start sending me two DHCP offers, with different IPs. The first offer containing an IP outside the range it was supposed to allocate. After my computer tried to accept the first offer, the router started ignoring me.

This behavior survived rebooting the router (and everything else involved), flushing the DHCP table fixed it for some reason.


I have a TP-Link Archer D9 on the "latest" firmware that will either:

1) stop giving out leases

2) suffer catastrophic failure in miniupnpd and stop allowing port forwarding

This is a $200AUD device and it's utter trash.


I had exactly the same problem with OSX - all other devices were fine but my macbook would drop packets after 30 seconds or so. Only fix seemed to be a new router. No problems since.


Interesting! I looked quite a bit to see if there was anything OSX-specific that anyone else online had documented, but nothing I turned up seemed to pan out. I am reasonably certain that the ultimate culprit here is bad low-end wifi devices that either suffer from bad hardware (too little RAM, etc.) or bad software (some poorly tested customized version of embedded Linux), or both. But I'm really curious why only OSX seems to cause the problem to manifest in several of these cases.


Your article doesn't mention it: Have you tried using a statically assigned IP address? (Poor etiquette inside the DHCP range, but maybe if you just snarf the IP you've already leased..)

I ask this because OS X's DHCP stack is known to exhibit some strange and unfriendly behaviours: http://cafbit.com/entry/rapid_dhcp_or_how_do


I did try that (though you're right that it's not in the article), setting the last DHCP address I'd received as static IP, but it doesn't seem to change anything. Still the same dropouts every ~30 seconds.


Maybe try force sending gratuitous ARP's every 10 seconds?

Also it might be interesting looking at a packet capture from both android and mac laptop to see what the difference is.


Hi mjn! I am living in the UK at the moment, and I'm occasionally experiencing the very same issue. My OSX device randomly drops packets (for 30s at a time), while my Android phone has no issues. I am connecting to the internet via a broadband wifi router.

I tried rebooting the router, but that does not always resolve the problem. However, when I reboot the Mac, the problem goes away for at least a couple of days.


All UPC spplied routers in Austria have that issue. Does not matter whichbwifi is connected. Thr only way I could fix his is to put a second NAT up and have it make a new dhcp server below it. It's a well know problem that neother apple nor tompson are fixing :(


UPC customer here as well, good to find I am not alone. Has this been reported to them?


I talked to the Austrian version of it and they seem useless. I don't think they understand there is a problem but as far as I understand it's the same story in other countries as well.


I've found that in 90% of the cases where packets are dropped randomly on a consumer-level access point, the automatic channel selection is to blame. Go to settings, select a static channel number, and the problem disappears completely.


In my case it doesn't seem to be the wifi side of things, but the routing side of things that's causing trouble. I can consistently ping the local AP itself, but the connection to the internet has periodic dropouts.


90% is probably too generous. I tried static channel selection on two wifi routers with the same issues described here and it does not fix the issue.


I used to frequent a coffee shop with a router that would refuse to let my laptop's wifi card connect about half the time. Like the author, my phone could connect but it would simply not respond to any attempts to connect from my laptop, whether booted into Windows or OpenBSD. The only solution was to ask the barista to reset the router, which was all the more frustrating when they refused to do it because everyone else could connect just fine.


That seems like the readonable answer on the part of the barista.


I had a similar issue to this some time ago. I upgraded our internet at home to Comcast's 250 megabit package, in doing so I had to upgrade our modem. I picked up a Motorola/Arris Surfboard 6183

I ended up with an interesting issue where the modem itself would "hang". My ping to Google would skyrocket from 5ms to 3000ms+, my download speed would drop from 250~ megabits to 0.20, the upload speed however would remain constant at about 30Mbps

Rebooting anything and everything had no effect, throwing various kinds of routing equipment in front of the modem (from OpenWRT to PF sense to even OpenBSD) made no difference. Eventually I purchased a "KanKun SmartSwitch". Some cheap Chinese wall adapter that you could power devices on and off using a phone app. Conveniently it ran OpenWRT and was hackable, so I was able to automate simply rebooting the modem when pings spiked into the 4 digit range (pinging Google every 60 seconds with a simple bash script)

I spent months talking to various Comcast departments. Technical Support, Tier 2, Tier 3, NOC, Headend, Engineering. Eventually I was told that (across the 14 states in my "division") there's only about 20,000 of the model of modem I had, simply not enough to be able to establish a problematic pattern (presumably most customers would just reboot the device when it crashed

Frustrated, I noticed on the Amazon store page that there were numerous complaints about the device with the same issue. I assumed it most likely to be a firmware issue, as the headend engineering team could not correlate any changes from my device on the node at the timestamps I gave them, still archived here http://intelminer.com/reboot.txt

Upon speaking to Amazon about a refund or exchange, they referred me to Arris/Motorola citing it was under warranty. Arris/Motorola then "helpfully" explained that they certify everything BUT the software of the device to be functional, as such I was not covered under warranty. (But hey why not buy a Surfboard 6190 instead? it IS newer!)

It seems like almost a racket for planned obsolescence. Release a decent modem, hire some interns to write crap software, then encourage the customer to "upgrade" the "faulty" hardware when they call in. After all, it's not under warranty now is it?


> "helpfully" explained that they certify everything BUT the software of the device to be functional

Perhaps because your ISP is responsible for providing and flashing the modem's firmware? That being said, I don't exactly trust Comcast to provide reliable software for my modem.


The firmware is maintained by the OEM and given to the ISP for distribution. The ISP does not specifically write (or maintain) the firmware used for customer owned equipment


Check out the Mikrotik routers www.routerboard.com these are linux based and rock solid. The HAp AC Lite retails for $50 https://routerboard.com/RB952Ui-5ac2nD

Also these routers are like swiss army knives they can do everything from DHCP Server to full BGP and MPLS. The only limitation is the CPU, Ram and the Interfaces.


Same problems for me, experienced this a lot on my home wifi, using a dual band, modem / wifi router combo. Problems ONLY occurred on my MacBook Pro, no other devices.

Purchasing a dedicated modem and using an Apple AirPort Extreme was the only thing that resolved the issue, which annoyed me and made me feel like Apple is further locking me into their ecosystem of devices.


MBPs don't support certain channels which is maddeningly frustrating. Trying to find from my notes which channels those are, but when I finally changed from auto to hard-setting to channel 100, all my issues with MBPs on my network went away.


There are certain channels in the 5Ghz range that require the access point to do checking to make sure they don't actively interfere with radar...

See here for a document from Cisco describing the issue:

http://www.cisco.com/c/en/us/td/docs/routers/access/3200/sof...

interestingly enough, hard setting the channel to 100 might violate FCC regulations, and DFS should still be used...


There isn't radar within 100 miles of me so I'm not too concerned. Regardless, this AP firmware won't let you hard set to channels the FCC requires DFS on.


Channel 100 requires DFS... so you did set it to a channel that requires DFS.


Can you please find the list? I think there are just 13 channels (or 11 depends where you are) 2.4Ghz


Still can't find it. I haven't had issues on 2.4ghz, this is on 5ghz. This thread seems to cover about the same thing I discovered:

https://www.reddit.com/r/apple/comments/2zo1uo/if_you_have_s...


I'll have to give that a go. My MBP performs poorly on dual-channel WiFi


I experience basically the same level of problems on our office's Cisco Meraki gateway/firewall. A couple months ago we had to do a support ticket to get them to fix their spanning-tree implementation because it configured loops in our simple dual-master crossover switch configuration, forcing us to disable one of the masters. Then there was a spurious packet loss problem in one of the default configuration parameters.

Now it just randomly stalls TCP connections open until the local node closes them, but allows the remote node to continue receiving packets(!), which is probably the worst thing a TCP system can do aside from simply not connecting.

I quite literally had to attempt this post twice before giving up and connecting to a UDP VPN. I have watched my colleagues cancel and refresh webpage loads for the last few months.


I've had all sorts of problems with Meraki APs. They claim to hop around to find the clearest channel, but they all always chose channel 44. Manual channel selection was limited to non DFS channels, otherwise they'd switch back to channel 44. There was no insight into CPU usage, so we were suspicious that some QoS settings made everything grind to a halt, but we couldn't confirm. Switched to high end Ruckus APs and a Juniper router/firewall and it's now working properly.


I dunno about the first, but the second looks oddly reminiscent of when i had a N800 and the router at home didn't have a first clue about wifi power saving.

IIRC how wifi does it is that the device signals the router and then shuts down the radio for some 100s of ms. This indicated to the router to hold the packets.

Now if the router do not have a clue about the signal, it would likely treat the device as gone. So when it comes back on and expects to continue from where it left off, the router gets royally confused.

Not sure why doing a dhcp request would fix it though, but then i have not the first clue about OSX innards.


My router stops working about once a day, but only if Apple products are connected (not sure yet if iPhone or macbook (they often come in pairs), but it's consistent across 4 flatmates).




Consider applying for YC's Summer 2026 batch! Applications are open till May 4

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: