Re: MSM760 with MSM410 and MSM430 Problems

February 25, 2016, 1:23 am

≫ Next: Re: MSM760: APs are slow on getting synchronized after controller reboot

≪ Previous: HP MSM firmware upgrade paths. Why is this not prominently documented? What is the recommended path?

I can't really help you with the AP weirdness, but what the port setup you need is depends on your needs.

If you don't want the controller NATing your traffic, you'll do fine with just the LAN port plugged in (that's how ours works). Having the same IP on two ports is probably going to be a bad idea (your client AP ARP tables are going to be a mess, and that might well explain some of the weirdness).

So long as you're confident your LAN port etc. is correctly configured, with whatever VLANs etc. it might need, try simply unplugging the WAN port. Some of the APs may lose connectivity until they get back on track with ARP (or the routers they go through if they're not connected to the controller at L2 "fix" their ARP entries). Power cycling them or factory resetting them with a paperclip might sort it out faster.

↧

Re: MSM760: APs are slow on getting synchronized after controller reboot

February 25, 2016, 1:29 am

≫ Next: Re: MSM760: APs are slow on getting synchronized after controller reboot

≪ Previous: Re: MSM760 with MSM410 and MSM430 Problems

I can confirm that APs take a troublingly long time to come online again - particularly if you change firmware versions; you can have a couple of hours of downtime across a larger site (we have over 70 APs, which suggests wireless is important to us) before things work again.

It's not really good enough, and I struggle to understand why pushing out firmware "updates" and "updating configurations" takes as long as it does.

↧

Re: MSM760: APs are slow on getting synchronized after controller reboot

February 26, 2016, 2:27 am

≫ Next: Re: Losing access to network

≪ Previous: Re: MSM760: APs are slow on getting synchronized after controller reboot

How is your access points configured to reach the controller?

Are the APs static ip address or dhcp?

Manually set or via dhcp or all aps and controller on the same vlan?

Are the APs Discovery set to the controller ip address?

I have 4x controllers and 400x APs and software upgrade from 6.5.0.0 to 6.2.0.0 took 30 minutes for upgrade, firmware flash to AP and syncronize.

↧

Re: Losing access to network

February 26, 2016, 11:19 am

≫ Next: Re: Losing access to network

≪ Previous: Re: MSM760: APs are slow on getting synchronized after controller reboot

I also am having this issue. I have read through the 6 pages of blog, and find that I am not alone. I have an MSM410, with NO controller, a stand alone unit. I was running Firmware 5.5 for about 3 years with NO problems. (Same clients, same hardware connecting and using the AP, about 20 devices). After 3 years, I decided to upgrade the firmware, so that I could connect with browser (SSL v3 not supported on newer OSes) and manage the device. Bad move! I upgraded about 10 days ago, and every 24 - 48 hours, I get called that our WiFi is dead. I log into it, and it seems like it is up and running, but the people using just lost connectivity. I restart (from web panel or PoE cable disconnect) cures the problem and life goes on.

There is only 1 VSC and wireless security is NOT checked.

Has anyone been able to make this work. Firmware is current at 6.6.2.0-22792

↧

Re: Losing access to network

February 29, 2016, 1:23 am

≫ Next: Re: Losing access to network

≪ Previous: Re: Losing access to network

Hello,

I'm having some big throughput issues. Recently we added a 5th controller to our team with 100 extra APs, having a total of ~760 APs and peaks of 4000 concurrent users. On the weekends it seems to be fine, we have a openwrt client making measures all the time to see how the throughput/latency and so on is working. The problem comes as soon as the number of users arise.

When there are about 1.5-2K simultaneous users connected, the manager controller CPU stays continuosly at 90-100% usage until the number of users drop, then the CPU lowers (it still has some spikes though),

From what I gathered the culprit seems to be a process called rrdsampler, it hogs the CPU and it is affecting the service. It is affecting the authentication process as well, I noticed that we have a ton more 802.1x timeouts than before, the throughput drops drastically and ping loss and latency increases. That happens on an AP without many users and the total throughput of the AP on the ethernet port is very low.

There are no big interferences detected, I went there with a spectrum analyzer to check if it could be an RF issue but I didn't find any problems, just a nearby AP that was on a different channel so no channel overlap there (5 channels of difference between them).

I know that RRDtool is used for graphing and storing statistics, maybe the issue here is trying to get too many statistics from each and everyone of the users. When there are few users it's ok, but when that value spikes it's just not working.

We are running 6.6.2.0, we have many 3 VSCs, 2 of them are tunneled through the controllers but the third one is not tunneled (sends the traffic directly from the AP to a VLAN tagged directly onto it). We are not using the team for control access, just for authentication through an external RADIUS server.

Our configuration is like this:

- We have the lower allowed speed rates disabled (11Mbps or higher are only allowed) to assure a good connection for each user.

- RRM enabled with auto-channel, auto-power and AP load balancing.

- Tx protection -> RTS/CTS with 1024 RTS Threshold to mitigate the hidden node problem (we took measures to see if this affected the overall throughput and it didn't seem affect that much).

I already opened a case with support but I would like to know if someone is experiencing the same issues I'm having. Mostly the rrdsampler process issue, if you want to check whether the process is hogging the CPU SSH the controller/AP and type top.

Aarón

Thanks!

Aarón

↧

Re: Losing access to network

February 29, 2016, 2:20 am

≫ Next: Re: Losing access to network

≪ Previous: Re: Losing access to network

Hi Aaron,

Do you have LLDP enabled?

Disabling LLDP dropped our CPU usage from 90-100% to 27-60%.

I have 4x MSM760 teamed with 400X MSM460 APs on software ver 6.6.2.0.

Regards,

Craig.

↧

Re: Losing access to network

February 29, 2016, 2:36 am

≫ Next: Re: Losing access to network

≪ Previous: Re: Losing access to network

No I haven't, I will try that and let you know how it goes, thanks!

↧

Re: Losing access to network

February 29, 2016, 3:18 am

≫ Next: Re: Losing access to network

≪ Previous: Re: Losing access to network

I just disabled LLDP but the CPU it's still very high, here is the top output command:

Mem: 2244736K used, 862380K free, 0K shrd, 315404K buff, 722896K cached
Load average: 3.51, 3.73, 3.70    (State: S=sleeping R=running, W=waiting)

  PID USER     STATUS   RSS  PPID %CPU %MEM COMMAND
25481 root     R        15M   449 91.5  0.5 rrdsampler ---> That is the process that hogs the CPU
 5853 root     R       141M   449 19.1  4.6 rfmgr_sc
  478 root     S       736M   449  9.5 24.2 regng
  815 root     S       3088   449  4.0  0.0 openvpn_master
  728 root     S        36M   449  2.7  1.2 openvpn
  452 root     S       6228   449  2.7  0.2 store-devices

Also here is a screenshot where you can see how the users increase (top graph), the manager controllers CPU increases as well (middle graph) and the bandwidth report deacreases (last graph). It happens everyday except on the weekends, where I the bandwidth was far better and consistent.

I added a second graph but with a week timespan where the you can see the behaviour I mean.

Any ideas? When I get some more info I'll keep on posting plus with any news from support.

Regards,

Aaron

↧

Re: Losing access to network

February 29, 2016, 3:37 am

≫ Next: Re: Losing access to network

≪ Previous: Re: Losing access to network

Another setting may be IGMP proxy under Home -> Network -> IGMP.
Have seen it mentioned together with high CPU.

Also, Radius accounting really adds a lot of processing. Disable it if you do not use it on the VSC.

Perhaps disable RRM to test

Check if you have severe interference checks enabled on the radios - also made my CPU usage higher.

Will post again if I think of anything else.

Regards,

Craig.

↧

Re: Losing access to network

February 29, 2016, 4:02 am

≫ Next: Re: Losing access to network

≪ Previous: Re: Losing access to network

Hello Craig,

this is how we have it configured:

- IGMP is disabled

- We need RADIUS for authentication

- We need RRM enabled as well so the APs are assigned to their channels in a way that make sense, we had issues before using it due RF interference between or APs.

- Severe interferences is disabled, we had a case prior to this one that showed that the APs were hopping between channels constantly and prevented RRM from running as those APs weren't on a "stable condition".

I still think it's just a graphing problem, let's see what does support say about it.

Regards,

Aarón

↧

Re: Losing access to network

February 29, 2016, 4:18 am

≫ Next: Re: Losing access to network

≪ Previous: Re: Losing access to network

Hi Aaron,

Radius authentication does not need accounting except if you use it for some kind of bandwidth limit etc.
I disabled radius accounting and users are still happily authenticating. ;-D

I also enabled RRM for auto channel and auto power, but then switched it off because the environment should not change all the time. You can perhaps disable it and run it manually once a week/month.

I would like to know what HP support says.

Hope you win.

Regards,

Craig

↧

Re: Losing access to network

February 29, 2016, 5:41 am

≫ Next: 3 MSM430 APs failing to update to firmware version 6.6.0.0 from version 6.4.2.0

≪ Previous: Re: Losing access to network

Hi Craig,

my bad, I read your prior post incorrectly, we only do authentication with a remote radius, no accounting whatsoever.

We have RRM enabled it so it runs automatically every night at 5.00AM, I'll wait for HPs response, maybe it would be better to not run it every single day, just once a week. I'll wait to see what they say.

As soon as I get any news on the case I'll post back.

Thanks again!

Aarón

PS: I hope I win too :D

↧

3 MSM430 APs failing to update to firmware version 6.6.0.0 from version 6.4.2.0

March 2, 2016, 1:57 pm

≫ Next: Re: MSM 760 upgrade from 5.7.3 to 6.x?

≪ Previous: Re: Losing access to network

We have a MSM720 controller which i just updated to the 6.6.0.0 firmware version. 12 of our MSM430 APs updated correctly. 3 of them at a remote office don't seem to be taking the update. The controller tries to upload the firmware but the APs just disconnect after a few minutes. This process repeats itself about every 30 minutes.

Any ideas as to what may be happening? I have updated the firmware on these units before with no problems.

Would i loose my config if I downgraded back down to version 6.4.2.0?

Thanks in advance for any advice.

↧

Re: MSM 760 upgrade from 5.7.3 to 6.x?

March 3, 2016, 11:33 am

≫ Next: Re: MSM 760 upgrade from 5.7.3 to 6.x?

≪ Previous: 3 MSM430 APs failing to update to firmware version 6.6.0.0 from version 6.4.2.0

Is it alright to do the direct jump from 5.7.3 to 6.5.3 or are there incremental jumps that should be done?

Thanks.

↧

Re: MSM 760 upgrade from 5.7.3 to 6.x?

March 4, 2016, 5:57 am

≫ Next: Re: 3 MSM430 APs failing to update to firmware version 6.6.0.0 from version 6.4.2.0

≪ Previous: Re: MSM 760 upgrade from 5.7.3 to 6.x?

You need to upgrade to 6.2.1.1 first.

↧

Re: 3 MSM430 APs failing to update to firmware version 6.6.0.0 from version 6.4.2.0

March 4, 2016, 7:12 am

≫ Next: Re: HP MSM firmware upgrade paths. Why is this not prominently documented? What is the recommended p

≪ Previous: Re: MSM 760 upgrade from 5.7.3 to 6.x?

Yes, you would loose config if you downgrade back to 6.4.2.0 from 6.6.0.0. Except for one scenario, if you were previously running 6.4.2.0 prior to upgrade to 6.6.0.0 then you wont loose config. As it would restore the config from the previous snapshot which would have been captured as part of the upgrade.

For remote APs not taking the firmware, are you able to confirm if they show up as "uploading firmware" then goes back to "resetting config --> pending --> Waiting for acceptance --> validating firmware"?

If the remtoe office over WAN link like MPLS, then there are good chance of somewhere MTU limiting the data packets coming from controller to AP. You might want to check if thats not the case.

Alternatively, if you want to update each of the MSM430 to 6.6.0.0 in an isoalted environment prior to attaching them back to controller environment. You can download the standalone image for MSM430 from the support HPE Networking support portal - https://h10145.www1.hpe.com/downloads/SoftwareReleases.aspx?ProductNumber=J9651A

↧

Re: HP MSM firmware upgrade paths. Why is this not prominently documented? What is the recommended p

March 4, 2016, 7:19 am

≫ Next: Re: MSM765 team APs reloaded

≪ Previous: Re: 3 MSM430 APs failing to update to firmware version 6.6.0.0 from version 6.4.2.0

Yes, you should be good to proceed with 5.5.x --> 5.7.5.0 --> 6.6.3.0.

↧

Re: MSM765 team APs reloaded

March 4, 2016, 7:22 am

≫ Next: Re: Teamed MSM760 Mobility Controllers synchronization issue / AP detection problem

≪ Previous: Re: HP MSM firmware upgrade paths. Why is this not prominently documented? What is the recommended p

Too little information to comment anything on this.. any logs or errors? screen snaps of what exactly is happening?

↧

Re: Teamed MSM760 Mobility Controllers synchronization issue / AP detection problem

March 4, 2016, 7:30 am

≫ Next: Re: MSM 760 upgrade from 5.7.3 to 6.x?

≪ Previous: Re: MSM765 team APs reloaded

From what you have mentioned, it almost appears to me that due to TEAMING license will be shared by all the 4 controllers included in the team. If for example for the 1st 2x controller had 150 each license and next 2x controller had 100 each. Total avaialble license becomes 500. Once all 4x controllers are in teaming, then each controller would have 125 license.

So if your APs were provided with DHCP option 43 to include all the 4x controllers. Then the APs would only be accepted based on each controller limit. In this case 125 APs per controller. Anyting beyond that 125 count, APs will not be accepted by any controller.

BUT if you reboot any one of the controller then the license pool will be re-calculated. Now 500 license / 3 controller == approx 166 APs per controller. So when you reboot any of the controller APs can be accepted by controller beyond the 125 limit upto the new limit of 166.

Hope this helps.

↧

Re: MSM 760 upgrade from 5.7.3 to 6.x?

March 4, 2016, 8:01 am

≫ Next: Re: Teaming two MSM 760 problem

≪ Previous: Re: Teamed MSM760 Mobility Controllers synchronization issue / AP detection problem

Thank you Merminger. This morning I took it from 5.7.3 to 5.7.7 then up to 6.2.1.2 then finally up to 6.5.3.0. Everything seems to be happy and I can finally start testing our new HP560 APs.

Thanks for the response.

↧