Monthly Archives: January 2017

When meet Issue about RRPP ring status on S7700 .

Issue Description

1. Topology

Topology
2. After configured, the sub ring status is preforwarding on S7700
RRPP Ring      : 2
Ring Level     : 1
Node Mode    : Edge Transit
Ring State     : PreForwarding
Is Enabled     : Enable                             Is Active: Yes
Primary port   : Ethernet2/0/2                 Port status: UP
Ethernet2/0/3                 Port status: UP
Secondary port : Ethernet2/0/5              Port status: BLOCKED

Alarm Information

None

Handling Process

Check all configuration on device. Found that RRPP working-mode is GB on S7700, but on S57 it is HW mode. Because of different mode, there are different node and configuration on Switch.
Let customer change RRPP working-mode to HW same with S5700 and test it works fine.

Root Cause

1. Wrong Configuration
2. RRPP calculation Issue

Suggestions

1. Changing RRPP working-mode needs to disable RRPP first and delete the old configuration
2. On S5700, RRPP only supports HW  working-mode and cannot be configured. S7700 supports both modes.

 

TwitterLinkedInGoogle+FacebookPinterestTumblrStumbleUponRedditShare

How to check the ifindex of an interface on the switch?

Issue Description

There are situations where we need to find out the corespondence between the ifindex and the interface names. For instance, if we receive a log on the switch that is refereing to a ifindex number but does not mention the exact interface name, we would need to know this corelation in order to indentify where the problem resides.

Example:

Feb  9 2016 08:47:12 xxxx %%01IFNET/4/BWRATE_OUT_RESUME(l)[0]:Interface output flow bandwidth usage was restored to the log threshold. (Interface=10, BandWidthUsage=85, LogThreshold=90)

Solution

To indentify the interface index of the interfaces, I can suggest the following solutions:

–Ru-Run the display rm interface command to check the ifindex of the intenterface directly in the CLI . By running this command the ifindex is shown in hex and you would need to convert it to decimal :

Example:

[R6_U11_S5710]display rm interface

Name: GigabitEthernet0/0/5

Physical IF Info:

IfnetIndex: 0xA         //  convert the hex value to decimal to obtain the ifindex (0xA = 10 )

State: DOWN BCA MULT

Hardware Address: F84A-BFF0-2E70

Slot: 0(Logic Slot: 9)

IntType: 36, PriLog: 0, MTU: 1500, Reference Count: 1

Bandwidth: 0, 100000000

Baudrate: 0, 100000000

Delay: 0, Reliability: 0, Load: 0

LDP-ISIS sync capability: disabled

LDP-OSPF sync capability: disabled

InstanceID: 0, Instance Name: Public

Age: 88548sec

-Use the below public OIDs to query the name and the ifindexes of the interfaces

ifName is Oid .1.3.6.1.2.1.31.1.1.1.1 and ifIndex is oid .1.3.6.1.2.1.2.2.1.1

When cannot ping the server through S5700HI VLL network.

Issue Description

1. Topology
Client — L3VPN — NE40E(PE) —- S5700HI-1(P) — S5700HI(PE) —– Access Switch — Server

2. Problem description
Configure VLL service between NE40E and S5700HI. Found some servers can ping and some cannot ping from client.

Alarm Information

None

Handling Process

1. Ping Server from access switch and it is ok.
2. Ping from NE40E and it is failed. Make traffic statistics on access switch to check icmp pakcet reach or not.
Result: No icmp packet reach
3. Check that there is no ARP table on NE40E. So the problem is ARP learning issue. Make ARP pakcets statistics on access switch. No arp request arrives.
4. on S5700HI P devices, capture packets on the port connects to NE40E. And found there is no arp packet.
5. Check the port configuration on NE40E. Customer configure QinQ sub interfaces. One command “arp broadcast enable” is missed.
interface Virtual-Ethernet1/1/1.100
control-vid 310 qinq-termination
qinq termination pe-vid 1002 ce-vid 310
ip binding vpn-instance l3v-sermgmt
ip address X.X.X.X 255.255.255.128
arp broadcast enable

Root Cause

1.Configure Issue
2. Network Issue

Suggestions

For sub interface and QinQ sub interface, command “arp broadcast enable” must be configured. Or device cannot send out the arp packets.

 

How to add an ethernet interface on Solaris Operating Syster?

Issue Description

Q:

How to add the new ethernet interface on Solaris Operating system?

Alarm Information

Null

Handling Process

A:

There are some steps to do this configuration in the Solaris base system.
Step 1:-Login to Operating system as a root user.
Step 2:-
Find the network interface by using the following command.
bash-2.05$ more /etc/path_to_inst  | grep net
“/pci@1f,700000/network@2″ 0 “bge”
“/pci@1f,700000/network@2,1″ 1 “bge”
“/pci@1e,600000/network@2″ 0 “ce”
“/pci@1d,700000/network@2″ 2 “bge”
“/pci@1d,700000/network@2,1″ 3 “bge”
bash-2.05$
Suppose you want to configure the ce0 interface.
#ifconfig ce0 plumb
Step 3:-
Set the ip address and subnet mask of the interface.
#ifconfig ce0 inet  <ipaddress> netmask <subnet mask>
Step 4:-
make the interface up.
#ifconfig ce0 up
Step 5:
Create/add the entry of ip address in the hostname file
#vi /etc/hostname.ce0
Add the ip address of ce0 /etc/hostname.ce0
<ip address of ce0>
Step 6:-
Add the entry of ip address in the /etc/hosts file.
#vi /etc/hosts
Add the following line in the file.
<ce0 ip address>    ce0
Step 7:-
Add the entry of subnet mask in the /etc/netmask file.
#vi /etc/netmask
Add the following line in the file.
<ce0 network>    <subnet mask>
Step 8:
Add the static route for the new interface network.
Permanent route.
-#cd /etc/rc2.d
#vi Sstaticroute
Add the following line in the file.
#route add -net <ce0 network> <gatway IP address>
Step 9:-
Temporary route.
#route add <ce0 network> <gatway IP address>

Root Cause

Adding the ethernet interface for expension/new netwrok expenssion.

Suggestions

No need to restart the server after adding the ethernet interface on Solaris based system.

 

If 5700 stack member fall out caused by A large number of MAC entries delete.

Issue Description

When the issue happened, S5700 stack member cannot be accessed intermittently. Status information about this stack member cannot be obtained through commands and this fault cannot be automatically rectified. After powered off and restarted the stack member, the fault was disappeared.
The command output showed that information about a stack member cannot be obtained. The following uses the display environment command as an example.

S5700

S5700

The preceding command output shows that temperature information about all stack members except the device with SlotID4 can be obtained normally. That is, obtaining temperature information about the stack member with slot ID 4 failed.

Alarm Information

None

Handling Process

1. Check the process of obtaining stack member status information in a stack.
In an S5700SI stack, the master switch obtains status information through Remote Process Call (RPC), and stack members exchange data by sending Interprocess Communication (IPC) messages. Because temperature information about a stack member cannot be obtained, a fault occurred during RPC invoking. RPC uses IPC messages to exchange information, so the IPC message exchange process may be abnormal.

2. Analyze the IPC processing flow.
When the stack member was restarted, the software was re-initialized, and the fault was rectified. Therefore, an error occurred during software processing. Additionally, powering off and restarting the stack member can rectify the fault, indicating that the fault occurred on the stack member.
View message queue statistics on the master switch.

S5700

S5700

The preceding command output shows that messages were accumulated in VLAN, L2MA, and CXQO queues. The L2MA message queue (MAC synchronization task message queue) was full of messages, indicating that the IPC tasks of stack members were suspended and cannot process IPC messages. As a result, messages were accumulated on the master switch.

4. Analyze the reason for IPC task suspension.
Because the fault occurred on a stack member, we checked the black box of the stack member.

S5700

S5700

The preceding command output shows that an infinite loop existed. Detailed information about the infinite loop is as follows:

[s5700_ST_5ET-diagnose]display deadloop 20 slot 4

============ Task Infinite Loop Information Begin ============
Dopra Version                    = DOPRA V100R006C09CP0671
Application Version              = UnConfig
Task Infinite Loop Type          = Task overrun
Task Infinite Loop Handle        = Reset system
Task Infinite Loop CpuId         = 0
Overrun Task Name                = DELM
Overrun Task VOS ID              = 21
Overrun Task Osal ID             = 0×06299840
Task Overrun Threshold           = 30000 (ms)
Task Has-run Time                = 30000 (ms)
Task Infinite Loop Occur Time    = [2014.05.28  18:14:02]
Task Infinite Loop Occur Cputick = [0x00023868, 0x456585a5]

The task experiencing an infinite loop is DELM, which is used to delete MAC addresses. When an infinite loop occurs, the mv_l2_del_addr_by_port function occupies the semaphore of MAC entries. When other tasks, for example the IPC task, need to operate MAC entries, these tasks will be suspended because no semaphore is available. However, the infinite loop cannot be broken. Subsequently, the IPC task is always suspended, resulting in the fault.
5. Analyze the reason for an infinite loop.
After a code walk-through was performed, messages notifying the deletion of MAC addresses were accumulated in the message queue when a large number of MAC entries were triggered in a short period. Due to a software processing bug, the DELM task was always reading the message queue status when the messages were accumulated. Consequently, an infinite loop occurred on the DELM task.
The infinite loop occurred because of the deletion of MAC entries. After analyzing logs, we found that S5700s often received STP TC messages from Eth-Trunk 5. After an S5700 received TC messages, it deletes MAC entries of the related interfaces.
6. conclusion
When a device was triggered to delete a large number of MAC entries, a software bug caused other tasks unable to apply for the semaphore of MAC entries. The IPC task was then suspended when applying for the semaphore, and the master switch cannot access other stack members.
7. After implement the workaround that run the stp edged-port enable command on the related ports to reduce TC messages, the issue is disappeared
8. The patch for this software bug will be released at the end of July. 2014 to resolve this issue completely.

Root Cause

1. High CPU usage
2. Stack cable problem
3. Software bug

Suggestions

When run STP on switches, configure stp edged-port on interfaces which connect to PCs and servers to avoid MAC addresses fresh frequently.

 

How much buffer we need on the VDSF board to convert VDSL2 frame?

Issue Description

Q:
The MA5606T should design some buffer for frame converting.Customer has asked us how much buffer we need on the VDSF board to accomplish this request.

Alarm Information

Null

Handling Process

A:
that the upstream buffer is 2M bytes and the downstream is 16M bytes.

Root Cause

Null

Suggestions

Null

When cannot log-in MSUITE of U2000V1R3.

Issue Description

After installation of U2000V1R3 in my laptop, the U2000 installation software would automatically create the U2000 Server, U2000 Client, U2000 system monitor as well as the U2000 MSuite function.

During training, I was sucessful in log-in all the above application such as the U2000 Server, U2000 Client and U2000 Sytem monitor funtions.

However, I can never log-in the U2000 MSuite.

Alarm Information

Log-in fail

Handling Process

Procedure to start U2000V1R3 MSUITE Application

00001. Go to C:\HWENGR\engineering

00002. Double click “msserver “

00003. Double click “ startserver “

00004. Log-in MSUITE application.

00005. Username = admin

00006. Password  = admin

 

Root Cause

I download all relevant U2000V1R3 installation & operation manuals from the Huawei support web-site.

After detail self-study & investigation, I manage to successfully log-in the MSUITE of U2000V1R3.

Suggestions

The Huawei U2000V1R3 installation & operation manual does not clearly explain the correct prodecure to start the MSUITE.

Perhaps, with my confirmed research result, please modify & improve our Huawei documentation in this aspect.

Hopefully, my knowledge sharing would help many Huawei engineers & customers as well.

 

How to change the OLT type device?

Issue Description

Q:

The system prompt by default is MA5680T instead of MA5600T, which means the device type is wrong.
In the fact of matter, MA5680T and MA5600T are kinds of the same thing, but we clarify for customer that the MA5680T just use for GPON service, so this is the difference of commercial policy at marketing.  We have ordered MA5600T but received MA5680T.
There is something wrong happened at supply of production upload, which induce us got the wrong type device.
      
      

Alarm Information

Null

Handling Process

A:
How we can change the device name on OLT.
1. log on OLT
2.Use “diagnose” and “su” mode, and Take the SCUL code (need a generated-passowrd tool, plz refer to the word in the attachment ) 
MA5680T#diagnose
MA5680T(diagnose)%%su
  Challenge:0KZQKQBT
  Please input password: (here need to use that tool I mentioned before)
3.Change device type of OLT
MA5680T(su)%%device MA5600T
MA5680T(su)%%quit

Root Cause

Null

Suggestions

With regard to the details of operation, which how to change the type of OLT, plz check the document in the attachment.

 

How many Remote frame ( RSU with HABD ) can UA5000-PVM board support?

Issue Description

Q:
In a Customer meeting customer ask about us how many  RT (remote frame RSU_HABD) we can add to COT( PVMB_HABD).

Alarm Information

Failure: Resource of HW error

Handling Process

A:
We have to confirm the limitation of the Remote frame ( RSU with HABD frame) per PVM board ( Main HABD with PVMB).
We check the document,no related information found then we try to add the frame manually in the system.
We can add the 32 remote frame successfully but when adding the next frame ie.33 it prompt one error message that HW-resource error.
After doing the above operation we came to know that PVMB supports on 32 remote frames.

Root Cause

We have to confirm the value of the Remote frame ( RSU with HABD frame) we try to search the same in support site and document but it was not present there.
Then we try to check it in the system because we have to confirm the value ASAP.

Suggestions

PVMB ( PVMB with master frame HABD)supports on 32 Remote frame ( RSU with HABD Frame).

What’s the fault S5700, and all multicast traffic is being forwarded?

Issue Description

S5700-X-LI-28P using software V200R003C00SPC300.

Traffic belonging to multiple multicast source are forwarded to all host, even if the
”display igmp-snooping port-info” command didn’t show any entry.

Alarm Information

We don’t have to configure ”multicast-vlan enable” and ”multicast-vlan user-vlan <vlan-id>” to get multicast to an interface where customer-vlan and multicast-vlan is added simultaneously and I can’t see any groups beeing joined using ”display igmp-snooping port-info” (almost seems like igmp-snooping isn’t working).

Handling Process

To  discard unknown multicast packets in VLAN 10, we can configure like this.
<HUAWEI> system-view
[HUAWEI] vlan 10
[HUAWEI-vlan10] multicast drop-unknown

Root Cause

By default switch broadcast all unknown mcast packets, this could explain why you get multicast traffic but no entry under “display igmp snooping port-info”
Unknown multicast flows are multicast data flows that match no entry in the multicast forwarding table. By default, the switch broadcasts unknown multicast flows in the corresponding VLAN. we can use the multicast drop-unknowncommand to configure the switch to discard unknown multicast flows, which reduces instant bandwidth usage compared with the broadcast mode.

Suggestions