I checked all combinations of HIB host-adapter cards and found some pairs working well on the test environment.
Test was done with V2 computer which is running as PXE boot, IO chassis (S1706947) and 3m-long copper HIB cable. MCF issue is that not only RCG but also OS cannot find PCIe cards on IO chassis. So test was done as checking PCIe card (Contec BIO1616) can be found by the 'lspci' command.
Because I checked multiple host cards and adapter cards, compatibility between host and adapter cards seems to be reliable. On the other hand, I used only one IO chassis and short copper cable. If there is some compatibility issue with main board of IO chassis and cable length, results below might not be reproduced in the mine environment. But as my experiences, combinations which didn't work well with short copper cable is always didn't work with long optical cables. So results below should help to narrow available combinations. Compatibility table between the serial numbers of adapter cards and host cards is as follows.
Unfortunately, there appears to be no such law as a simple revision dependency. Also, the two adapter cards in the bottom row were not recognized by any of the host cards, so it seems likely that they are malfunctioning. Yesterday, I used host card of S/N=120018 and adapter cards of S/N=197215, 201053, and 197223 which were labeled "OK" (I'm not sure who and when checked it and hot to check it okey.). But any combination doesn't work also on the test stand.
|---------------+--------+--------+------------+------------+------------+------------+-------|
| A\H | 120018 | 120016 | QS13491142 | QS13491113 | QS13491022 | QS13491183 | HIB35 |
|---------------+--------+--------+------------+------------+------------+------------+-------|
| 120041 (none) | o | o | o | o | o | o | o |
| 120319 (none) | o | o | o | o | o | o | o |
| 120043 (none) | x | o | o | o | o | o | o |
| 119895 (none) | x | x | o | o | o | o | o |
|---------------+--------+--------+------------+------------+------------+------------+-------|
| 197215 (B1) | x | o | o | o | o | o | o |
| 197224 (B1) | x | o | o | o | o | o | o |
|---------------+--------+--------+------------+------------+------------+------------+-------|
| 201053 (B2) | x | o | o | o | o | o | o |
| 197223 (B2) | x | o | o | o | o | o | o |
|---------------+--------+--------+------------+------------+------------+------------+-------|
| 197219 (B1) | x | x | x | x | x | x | x |
| 201050 (B2) | x | x | x | x | x | x | x |
|---------------+--------+--------+------------+------------+------------+------------+-------|
YamaT-san (Remote), Washimi-san, Yokozawa-san, Nakagaki-san, Ikeda
We have performed recovery work on K1MCF0.
First, We tried turning on the RTPC, and it booted up normally.
The key difference from the previous K-Log #33397 is as follows:
We turned off the amplifier (KM750) before powering on the RTPC.
Status before starting work:
IO chassis power was ON.
RTPC was powered OFF, and both the HIB cable and USB keyboard were disconnected.
The PEM amplifier power was ON.
Work was carried out in the following steps:
1. Requested the PEM team to check the cable connections, and no issues were found.
2. The PEM team turned off the amplifier (KM750).
3. Reconnected the cables that had been removed during the previous test:
Connected the RTPC's HIB cable and USB keyboard.
4. Turned on the RTPC power.
5. After startup, confirmed via dmesg that the RTPC recognized cards such as ADC/DAC.
6. Turned the amplifier power back on (which was turned off in step 2).
7. Tidied up the shaker cables.
8. Performed an injection test using the speaker.
9. Performed an injection test using the shaker.
No issues occurred in any of the above steps.
For now, we will leave the system as it is and monitor it.
If a similar issue occurs in the future and cannot be resolved using the standard recovery procedure, we will test whether powering off the amplifier resolves the issue.
If the system recovers just by turning off the amplifier, there may be a grounding issue on the amplifier side.
If that doesn’t help, we will consider replacing the HIB card or taking other measures.