Opened 12 years ago

Closed 12 years ago

Last modified 12 years ago

#26 closed defect (invalid)

version 185 trap 0008

Reported by: Barry Landy Owned by: somebody
Priority: minor Component: basedrv
Version: Keywords:
Cc: bl10@…

Description

I have now had 2 trap 8's in my T61 using version 185. Both times a memory stick was insrted in the USB 2 port.

The first trap screen showed corruption like an earlier one (under 184) that I posted; the second was not corruption and I will attach a screen shot.

I also noticed a possibly USB related entry in popuplog which I will also attach. I have put priority minor simply because it is not happening frequently.

Attachments (10)

popuplog.os2 (556 bytes) - added by Barry Landy 12 years ago.
Image0062a.jpg (114.2 KB) - added by Barry Landy 12 years ago.
screenshot of trap 0008
brandy.zip (232.6 KB) - added by Lars Erdmann 12 years ago.
USBEHCD: readd mutex in AccNonIsoReq?
brandy.2.zip (232.9 KB) - added by Lars Erdmann 12 years ago.
USBEHCD: readd mutex in AccNonIsoReq?, misc. changes
pci (72.6 KB) - added by Barry Landy 12 years ago.
brandy.3.zip (232.9 KB) - added by Lars Erdmann 12 years ago.
USBEHCD: reverting various TD init stuff etc.
brandy.4.zip (232.8 KB) - added by Lars Erdmann 12 years ago.
(only) USBEHCD (for now): change in timeout timer handling
brandy.5.zip (233.2 KB) - added by Lars Erdmann 12 years ago.
USBEHCD, "SetBits?","ResetBits?": changed to a "conservative" implementation
brandy.6.zip (233.2 KB) - added by Lars Erdmann 12 years ago.
USBEHCD, "RemovePeriodicQH": backing out freeing QH
irq (1.7 KB) - added by Barry Landy 12 years ago.

Download all attachments as: .zip

Change History (64)

Changed 12 years ago by Barry Landy

Attachment: popuplog.os2 added

Changed 12 years ago by Barry Landy

Attachment: Image0062a.jpg added

screenshot of trap 0008

Changed 12 years ago by Lars Erdmann

Attachment: brandy.zip added

USBEHCD: readd mutex in AccNonIsoReq?

comment:1 Changed 12 years ago by Lars Erdmann

Did a change to USBEHCD (and other changes to USBUHCD.SYS,USBOHCD.SYS,USBEHCD.SYS).
Please try brandy.zip.
Since this is a race condition / timing issue you will need to test for quite some time.

Last edited 12 years ago by Lars Erdmann (previous) (diff)

comment:2 Changed 12 years ago by Barry Landy

will do.

comment:3 Changed 12 years ago by Barry Landy

One more trap 0008 with corruption. I had changed to the brandy set but not rebooted.

I was using unzip from a memoy stick (USB2) to update files on HDD.

(now rebooted of course and will keep monitoring)

comment:4 Changed 12 years ago by Barry Landy

Still no trap 0008 to report

However, suspend resume no longer works.

Suspend (from the ACPI setup icon) works; but resume does not complete.

(I suspended AFTER ejecting a memory stick)

comment:5 Changed 12 years ago by Lars Erdmann

Suspend/resume is known to cause problems with ACPI.PSD. There is not much I can do about it until the problems are sorted out on ACPI.PSD side.
I can not do a suspend/resume on my machine either.

comment:6 Changed 12 years ago by Barry Landy

But suspend resume works ok on both earlier versions (184 AND 185)... ! so I assume that your fix to my trap 0008 did something to break resume?

comment:7 Changed 12 years ago by Lars Erdmann

I haven't touched the code that is called on suspend/resume. That's all I can say.

Last edited 12 years ago by Lars Erdmann (previous) (diff)

comment:8 Changed 12 years ago by Barry Landy

just a suggestion. Your last change was (I believe) to add a mutex. Could it be left locked on suspend/resume?

comment:9 Changed 12 years ago by Barry Landy

Well well.

Booted OS2; did not use USB; (used CD instead); closed the lid expecting nothing to happen but it suspended. I tried resume and it worked.

Will try the opposite experiment tomorrow but that does make it look as though using a USB stick changes the suspend resume behaviour.

comment:10 Changed 12 years ago by Barry Landy

later: after inserting a memory stick and writing to it, then ejecting it. lid close no longer suspended. However resume still works fine.

comment:11 Changed 12 years ago by Barry Landy

One further trap 0008 using brandy.zip

comment:12 Changed 12 years ago by Barry Landy

and another of the corruption type

Changed 12 years ago by Lars Erdmann

Attachment: brandy.2.zip added

USBEHCD: readd mutex in AccNonIsoReq?, misc. changes

comment:13 Changed 12 years ago by Lars Erdmann

Please try brandy2.zip.
Also please add output of "pci.exe -D" so that I know your system components.
Is this a multi-core system ?

comment:14 Changed 12 years ago by Lars Erdmann

Can you run 10.183 for some time ? I am not yet convinced that the trap 8 stems from the USB drivers.
Please run 10.183 for a couple of days and try to provoke the trap. It it occurs with 10.183 we know that the problem must be somewhere else.
As a second test, use the 10.185 package and only replace USBD.SYS with the one from the 10.183 package.

comment:15 Changed 12 years ago by Barry Landy

Was out of touch for a couple of days. I am not sure what you would like me to do first....

re the question: it is a coreduo system but I am running with maxcpu=1 as with both cores active winos2 doesnt work.

Changed 12 years ago by Barry Landy

Attachment: pci added

comment:16 Changed 12 years ago by Barry Landy

output of pci -d attached

comment:17 Changed 12 years ago by Barry Landy

brandy.2 also failed (I got a trap 0008 first time of trying)

version 183 worked (one try).

I have noticed that the CPU meter (either with one or two cores) shows 99% most of the time for no obvious reason (both 185/185a/185b (the brandy versions) and 183)

comment:18 Changed 12 years ago by Barry Landy

1) version 183 seems to work repeatedly

2) the 100% CPU is probably Firefox (so not relevant here)

comment:19 Changed 12 years ago by Lars Erdmann

You will need to test 10.183 for at least a week of constant use. One day is too short to say for sure.

Also, what version of ACPI.PSD are you using ?
You might want to grab the newest version which is 3.21.02 (as always you will find it at the Mensys website via your "Software Choice" download opition).
Version 3.21.01 had fixed a problem where PCI devices where generating System Errors (where they should not). That could very well also lead to trap 8.

Last edited 12 years ago by Lars Erdmann (previous) (diff)

comment:20 Changed 12 years ago by Barry Landy

I am indeed using 3.21.02. Will continue to test 183. However when I switched 185a/185b was trapping every time of trying sooner or later.

Changed 12 years ago by Lars Erdmann

Attachment: brandy.3.zip added

USBEHCD: reverting various TD init stuff etc.

comment:21 Changed 12 years ago by Lars Erdmann

Try brandy.3.zip in between. I have reverted some stuff to 10.183 level.
If that works (after at least one week of permanent use) I would like to change one other thing.

comment:22 Changed 12 years ago by Lars Erdmann

Barry, can you please add a valid email address to the CC field so that you can get email notifications ?

Changed 12 years ago by Lars Erdmann

Attachment: brandy.4.zip added

(only) USBEHCD (for now): change in timeout timer handling

comment:23 Changed 12 years ago by Lars Erdmann

Try brandy.4.zip.

Please let me know if you get a trap 3. In that case, please post the trap screen.

comment:24 Changed 12 years ago by Barry Landy

Cc: Barry Landy added

comment:25 Changed 12 years ago by Barry Landy

I have added the cc

Sorry for the delay : was travelling

I have now tried brandy.4 was seems good on the first trial (and no trap 3)

comment:26 Changed 12 years ago by Lars Erdmann

You have to add a valid email address as CC and NOT your user name (unless you tell me that you successfully received this notification) !

Anyway, if brandy.4 is working ok in the next couple of days I want to add back in some things. Please let me know in a couple of days.

comment:27 Changed 12 years ago by Lars Erdmann

By the way: do you have Theseus installed ? If not get it from here:
http://www.os2site.com/sw/util/memory/thes4001.exe

Once you have installed it (you will just need to unzip the files contents) let me know.
I will then give you some instructions of what you need to do to examine the trap8.

One question in advance: are you using the IBM mouse driver or AMOUSE ? I guess you are using the IBM mouse driver, correct ?

Last edited 12 years ago by Lars Erdmann (previous) (diff)

comment:28 Changed 12 years ago by Lars Erdmann

By the way: had you freshly updated ACPI.PSD when the trap8 started to show up ?

comment:29 Changed 12 years ago by Barry Landy

I did not see an option to add an email address but wll look harder.

I do not have Theseus installed but will do so.

I believe it is the IBM mouse but I have gtried both in the past (in connection with getting the mouse to work after resume).

I dont THINK I had freshly updated ACPI when the trap 8 started to show up but it is quite a long time ago when I had the first one

comment:30 Changed 12 years ago by Barry Landy

1) Please point to an option to add email as a cc 2) OS2site needs a login which I dont have so that's something else to investigate.

comment:31 Changed 12 years ago by Lars Erdmann

The CC field is a text entry field. You can enter anything you want. If you give me an email address I will add it for you.

Did you get the email notifications or not ?

Changed 12 years ago by Lars Erdmann

Attachment: brandy.5.zip added

USBEHCD, "SetBits?","ResetBits?": changed to a "conservative" implementation

comment:32 Changed 12 years ago by Lars Erdmann

Forget brandy.4.zip. Go for brandy.5.zip.

I am starting to believe that you might have problems with the new ACPI.PSD version.

Last edited 12 years ago by Lars Erdmann (previous) (diff)

comment:33 Changed 12 years ago by Barry Landy

OK will try brandy.5 (but brandy.4 seemed good). The OS2SITE says "ask around" to find out how to get a signon; can you tell me?

I cant see a text entry for cc only a check box : I did add my email to preferences just now will that help?

If not: bl10@…

Changed 12 years ago by Lars Erdmann

Attachment: brandy.6.zip added

USBEHCD, "RemovePeriodicQH": backing out freeing QH

comment:34 Changed 12 years ago by Lars Erdmann

Forget about brandy.5.zip. Go for brandy.6.zip

comment:35 Changed 12 years ago by Lars Erdmann

Cc: bl10@… added; Barry Landy removed

comment:36 Changed 12 years ago by Lars Erdmann

You should now get notifications.

Yes, maybe it helps to add your email address to the preferences. I have not set up this TRAC system.

comment:37 Changed 12 years ago by Barry Landy

Now getting notifications. Maybe it was the preferences.

optimistically trying to do more than one thing at once I had traps with both brandy.4 and .5 (using WIFI in parallel with the USB device).

Will try .6 (quite tedious because of all the reboots!! post traps)

comment:38 Changed 12 years ago by Barry Landy

OK. One successful try of brandy.6 : will keep trying.

comment:39 Changed 12 years ago by Lars Erdmann

I am beginning to believe that you have more than just a USB problem.
You should check which devices share IRQs with the various USB HCs.
Additionally you should consider backlevelling ACPI.PSD to some older version and observe if that gets you rid of the traps. If yes, you will then need to address this problem to the ACPI bug tracker.

Changed 12 years ago by Barry Landy

Attachment: irq added

comment:40 Changed 12 years ago by Barry Landy

I have attached the output of rmview /irq. The sharing of 20 between e1000e (wifi) and UHCD seems suspicious. I will also try backlevelling ACPI to 3.20.03

comment:41 Changed 12 years ago by Barry Landy

1) It is quite plain that the WIFI and the USB interfere with each other (eg, before the system died locked up I got a "USB device not ejected cleanly" (cant remember the exact words) when the device had not been touched bit data was going down the WIFI route. This I assume is an ACPI problem? the old version 3.20.03 was no different.

2) I think brandy.6 is OK so long as I dont use the WIFI.....

comment:42 Changed 12 years ago by Barry Landy

David Azarewich suggested that the E or X centre USB widget might be responsible for the problems observed Friday. I remover the widget and that indeed turns out to be the case.

Using the latest brandy.6 remains OK

Would you like me to revert to the original 185?

comment:43 Changed 12 years ago by Lars Erdmann

No, don't bother to revert back to 10.185. Just wait for 10.186. Effectively I have now backlevelled most of the things that I changed starting from 10.185.
At least you now know that you can stick to the most recent ACPI version.

Let me know if I can close this ticket.

Last edited 12 years ago by Lars Erdmann (previous) (diff)

comment:44 Changed 12 years ago by Lars Erdmann

Resolution: invalid
Status: newclosed

comment:45 Changed 12 years ago by Lars Erdmann

Removing the USB widget and instead using the daemon program usbmsdd.exe fixed the problem.
There was no problem with the USB HC drivers.

comment:46 Changed 12 years ago by Barry Landy

Sorry could not reply earlier (Jewish New Year). Quite happy that you closed the ticket.

comment:47 Changed 12 years ago by Lars Erdmann

What driver are you using for your WiFI device ? Is it fairly new ? If yes you should raise a bug against eComStation so that someone can investigate if that WiFI driver works ok or not.

Or was it the "e1000e_0 Intel 82566MM Network Adapter" rather than the WiFI device ?

Last edited 12 years ago by Lars Erdmann (previous) (diff)

comment:48 Changed 12 years ago by Barry Landy

Its a Genmac driver GENM32W and not new. The e1000e is the wired ethernet port.

WIFI seems to work OK if I dont have the USB widget.

comment:49 Changed 12 years ago by Barry Landy

Version 186 seems to work well for me (without the USB widget!). If I get problems I will open a new ticket.

comment:50 Changed 12 years ago by Lars Erdmann

One last thing:

1) please install Theseus. See comment #27
2) start Theseus, select System->General System->Device Drivers
3) search for the line that reports "01F38" as its data segment limit (second column)
4) tell me which driver (name) it is (fifth column). If you should find more then one with that data segment limit, tell me all of them (you can search in the window with Ctrl-F / Ctrl-A)

Thanks !

comment:51 Changed 12 years ago by Barry Landy

Installed Theseus 4 (thanks for sorting out the need to login to get it)

I found no drivers ending at that address.

comment:52 Changed 12 years ago by Barry Landy

After spending a week trying to recreate the conditions that caused a trap (after adding a dump partition) I have succeeded by formatting the memory stick JFS (HPFS/FAT32 do not seem to provoke this problem).

Currently with version 185 of the USB software and with the USB Widget in place.

This is a non corruption dump and I can attach a mobile phone pic of the trap. CS:EIP is 07c0:000009b7

Who would like me to do what with the 2Gb dump I hope was taken?

comment:53 Changed 12 years ago by Barry Landy

To catch up with stuff posted mainly to the newsgroup.

When trying with version 185 and the USB widget in place to provoke a dump I discovered another effect.

Writing a long series of writes (86 files 86Mb) from HDD to a JFS formatted USB stick I sometimes get a trap 8 (though I have not yet managed to provoke it when dumping was set up) and sometimes I get a hang of the write process (but NOT the system). In the hang state if I manually eject the stick the system gives a trap 8.

I have a dump of that and have passed it to Stephen Levine.

I will continue trying to provoke a trap 8 from the write process as it might be a different problem for a little and then return to version 196 and see if any traps occur. If I generate a trap with 186 I will start a new ticket.

comment:54 Changed 12 years ago by Barry Landy

moments after posting that I succeeeded in dumping a trap 8 and will arrange to process it.

Note: See TracTickets for help on using tickets.