Email: Password: Remember Me | Create Account (Free)

Back to Subject List

Old thread has been locked -- no new posts accepted in this thread
???
02/23/12 13:27
Read: times


 
#186158 - Mars Pathfinder
Responding to: ???'s previous message
Kai Klaas said:
Christoph said:
Famous example of a watchdog saving the day: Mars Pathfinder. It's hard to send someone to hit the reset button on a device that's located on another planet.

Are you sure that a watchdog solved the problem? Or was it a simple clock which periodically turned-off and turned-on the power supply?


I believe in case of Mars Pathfinder, a software bug in the operating system caused a lockup, so a normal reset (and the system restarting in a safe mode) would have been sufficient. They were able to find the bug, fix it, and upload the corrected software to the rover after this incident.

Kai Klaas said:
A disadvantage of a watchdog that only provides a reset is, that it cannot recover circuitry from typical lock-up. Only a full power-down followed by a power-up can help then. So, a better watchdog would provide a power-down and power-up cycle, instead of a simple reset.


What action the watchdog should take on triggering depends on the target system. If the CPU can end up in states that can only be exited by cycling the power (the external slow clock one is one of them, but there are tiny chances that the processor might end up in boundary scan mode or fast flash programming mode if the external events are severe enough), the watchdog should do that. However, other parts of the hardware might not end up in a safe state if power is suddenly cut. That's one of the recommendations Jack Ganssles article gives - have hardware put the system in a safe state (even if that safe state means blowing up a rocket carrying a few hundred million Euro worth of satellites over the Pacific ... oops).

But yes, the uC in question had several documented states in which it will not even respond to an external reset. I guess the manufacturer didn't want to explicitly recommend an external watchdog that can cycle power, but I'll add it to my mental notes that even the mere mention of processor states that last until the next power cycle means that the watchdog must be able to cycle power of the uC.

Kai Klaas said:
This could be combined with a very well defined power-up Vcc slew rate, which is essential for many todays µC. They get problems if Vcc rises too slow or too fast.


True. We started with engineering samples of the uC in question, and the manufacturer kept, err, updating the documentation of the power-up sequencing. Fun. Especially since it wasn't just a simple case of keeping the slew rate where the processor wanted it, but different voltages (VDDIO, VCORE) had to follow rules like "one may not exceed the other", or "one must reach at least voltage X before the other reaches voltage Y". Sometimes I'm glad I'm only the firmware engineer and the hardware is done by people who have about infinitely more knowledge about the subject than I.

Kai Klaas said:
In such an application it would make way more sense to immunize the port lines by ESD filters, wouldn't it?


Possibly. However, in our case, I believe there are limits to the circuitry given by the requirements spec (amplitude response, input impedance, etc), and the device is explicitly not expected to continue operating through some of the more severe external events, it's just required that it does not suffer permanent damage and returns to normal operation within ten seconds.



List of 44 messages in thread
TopicAuthorDate
Pointless watchdog?            01/01/70 00:00      
   Sounds Like From What You Said...            01/01/70 00:00      
      Didn't want to point fingers.            01/01/70 00:00      
         A?            01/01/70 00:00      
            Don't forget AVR            01/01/70 00:00      
               One more argument ...            01/01/70 00:00      
                  Independently testable            01/01/70 00:00      
                     Slow but critical.            01/01/70 00:00      
                        Yes            01/01/70 00:00      
                           ESD            01/01/70 00:00      
                               You can't            01/01/70 00:00      
                                 You can...            01/01/70 00:00      
                                    But emi-filter, overvoltage protection etc may not be enough            01/01/70 00:00      
                                       ESD ...            01/01/70 00:00      
                                          It's not just ESD, it's "the works"            01/01/70 00:00      
                                             I do not trust the watchdog...            01/01/70 00:00      
                                                The simpler, the more trustworthy.            01/01/70 00:00      
                                                   Less complex does not mean less probable to fail!            01/01/70 00:00      
                                                   WDT and debug breadcrumbs            01/01/70 00:00      
                                                Lightning strikes            01/01/70 00:00      
                                                   A watchdog should never be needed - but should be there            01/01/70 00:00      
                                                   A watchdog isn't a fuse...            01/01/70 00:00      
                                                      correction            01/01/70 00:00      
                                                      I just can not agree with that reasoning            01/01/70 00:00      
                                                         Oh please, don't...            01/01/70 00:00      
                                                            so?            01/01/70 00:00      
                                                Why you need a watchdog            01/01/70 00:00      
                                                   And even if you had perfect software and hardware ...            01/01/70 00:00      
                                                      Reset versus power cycle            01/01/70 00:00      
                                                         Mars Pathfinder            01/01/70 00:00      
                                                         many modern chips            01/01/70 00:00      
                                                            The problem with bidirectional reset pins.            01/01/70 00:00      
                                                               The uC in question            01/01/70 00:00      
                                                            Latchup            01/01/70 00:00      
                                                               ONLY clocked circuits?            01/01/70 00:00      
                                                                  That kitchen exhaust fan ...            01/01/70 00:00      
                                                            link...            01/01/70 00:00      
                                                               Sales talk            01/01/70 00:00      
                                                   certified applications            01/01/70 00:00      
                  Using external watchdog            01/01/70 00:00      
                     It's a feature, actually.            01/01/70 00:00      
               Wet            01/01/70 00:00      
   Featuritis...            01/01/70 00:00      
   Looking at it from another perspective            01/01/70 00:00      

Back to Subject List