본문 바로가기
OS-OE Knowledge/Linux-Unix KB

RHEL/general protection rip: rsp: error:0, segfault at rip rsp error 6 or 4

by 스쳐가는인연 2016. 2. 12.

하드웨어 점검 시 특이사항이 없는데, OS상에서 특정 프로그램 관련 General Protection 오류가 기록되는 경우


  Memory (16 DIMM(s) for 64 GB total memory)
    Component         Size       Cfg Spd      Max Spd      C\Err  UC\Err  Smart  Technology   Vendor
    -------------------------------------------------------------------------------------------------------------------
    BOARD 1 DIMM 4    4.1 GB     1067 MT/s    1866 MT/s    0       0      Yes     RDIMM       Micron                  
    BOARD 1 DIMM 9    4.1 GB     1067 MT/s    1866 MT/s    0       0      Yes     RDIMM       Micron                  
    BOARD 2 DIMM 4    4.1 GB     1067 MT/s    1866 MT/s    0       0      Yes     RDIMM       Micron                  
    BOARD 2 DIMM 9    4.1 GB     1067 MT/s    1866 MT/s    0       0      Yes     RDIMM       Micron                  
    BOARD 3 DIMM 4    4.1 GB     1067 MT/s    1866 MT/s    0       0      Yes     RDIMM       Micron                  
    BOARD 3 DIMM 9    4.1 GB     1067 MT/s    1866 MT/s    0       0      Yes     RDIMM       Micron                  
    BOARD 4 DIMM 4    4.1 GB     1067 MT/s    1866 MT/s    0       0      Yes     RDIMM       Micron                  
    BOARD 4 DIMM 9    4.1 GB     1067 MT/s    1866 MT/s    0       0      Yes     RDIMM       Micron                   
    BOARD 5 DIMM 4    4.1 GB     1067 MT/s    1866 MT/s    0       0      Yes     RDIMM       Micron                  
    BOARD 5 DIMM 9    4.1 GB     1067 MT/s    1866 MT/s    0       0      Yes     RDIMM       Micron                  
    BOARD 6 DIMM 4    4.1 GB     1067 MT/s    1866 MT/s    0       0      Yes     RDIMM       Micron                  
    BOARD 6 DIMM 9    4.1 GB     1067 MT/s    1866 MT/s    0       0      Yes     RDIMM       Micron                  
    BOARD 7 DIMM 4    4.1 GB     1067 MT/s    1866 MT/s    0       0      Yes     RDIMM       Micron                  
    BOARD 7 DIMM 9    4.1 GB     1067 MT/s    1866 MT/s    0       0      Yes     RDIMM       Micron                  
    BOARD 8 DIMM 4    4.1 GB     1067 MT/s    1866 MT/s    0       0      Yes     RDIMM       Micron                  
    BOARD 8 DIMM 9    4.1 GB     1067 MT/s    1866 MT/s    0       0      Yes     RDIMM       Micron 


OS : Red Hat Enterprise Linux Server release 5.10 (Tikanga)
Kernel : 2.6.18-371.el5


kernel: App[] general protection rip: rsp: error:0
kernel: App[]: segfault at  rip  rsp  error 6
kernel: App[]: segfault at  rip  rsp  error 4


What are those "general protection" errors logged to /var/log/messages on my x86_64 system?
https://access.redhat.com/solutions/2336


Resolution
The message indicates that the program called someprogram has attempted to access a non-canonical address. This is usually caused by a bad handling in the program which results in an incorrectly set pointer. This is very analogous to segmentation fault situations.

The only way to avoid this is by fixing the code in the program.


Action Plan 1.
What: System 재조립
When: ASAP, Memory 오류가 의심되는 경우.
To do.
1) Memory DIMMs 일괄 재장착
2) PCI 확장카드 일괄 재장착
3) BIOS구성 백업 및 NVRAM 초기화(System Maintenace switch #6 이용
4) SPP 2015.10으로 부팅하여, CPU 및 메모리만 선택 후 진단 툴 연속 30회 수행 << 메모리 테스트
5) SPP 2015.02 또는 IP 2.20 으로 부팅하여, CPU 및 메모리만 선택 후 진툴 연속 10회 수행 << 진단 엔진 변경 후 메모리 테스트


Action Plan 2.
What: LX OS 부팅 후 General Protection 기록된 Appz 관련 모듈/서비스 비활성화/비구동 및 유사 이벤트 발생 여부 확인
When: A/P 1 수행 후

반응형